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PREFACE 


This book is intended for a one-year undergraduate course in abstract algebra. 
Its design is such that the book can also be used for a one-semester course. 
The book contains more material than normally would be taught in a one-year 
course. This should give the teacher flexibility with respect to the selection of 
the content and the level at which the book is to be used. We give a rigorous 
treatment of the fundamentals of abstract algebra with numerous examples 
to illustrate the concepts. It usually takes students some time to become 
comfortable with the seeming abstractness of modern algebra. Hence we begin 
at a leisurely pace paying great attention to the clarity of our proofs. The only 
real prerequisite for the course is the appropriate mathematical maturity of 
the students. Although the material found in calculus is independent of that 
of abstract algebra, a year of calculus is typically given as a prerequisite. Since 
many of the examples in algebra comes from matrices, we assume that the 
reader has some basic knowledge of matrix theory. The book should prepare 
the student for higher level mathematics courses and computer science courses. 
We have many problems of varying difficulty appearing after each section. We 
occasionally leave as an exercise the verification of a certain point in a proof. 
However, we do not rely on exercises to introduce concepts which will be needed 
later on in the text. 

Topics are introduced that have never appeared in this type of textbook. 
They include Grobner basis, rings of matrices, and Noetherian and Artinian 
rings. Another distinguishing feature of the book is the Worked-Out Exercises 
which appear after every section. These Worked-Out Exercises provide not 
only techniques of problem solving, but also supply additional information to 
enhance the level of knowledge of the reader. For example, in Chapter 7, we 
illustrate several techniques that are very effective in determining the Sylow 
subgroups of a group, whether the group is simple or not, and in determining 
the structure of a group. In Chapter 9, we give numerous examples and show 
how to determine different Abelian groups of a given order. We also show how 
to find the elementary divisors, the torsion coefficients, and the betti number 
of a finitely generated Abelian group. In Chapter 15, we give an algorithmic 
procedure to find the greatest common divisor and illustrate it in full detail. 





xiv 


We also illustrate how to show whether an element is prime and/or irreducible. 
In Chapter 24, we give numerous examples to show how to determine the Galois 
group and the intermediate fields of a Galois field extension. Of course, each 
section is followed by problems of varying difficulty for the reader to further 
master the subject. The reader should study the Worked-Out Exercises that 
are marked with <C> along with the chapter. Those not marked with <0> may be 
skipped during the first reading. Sprinkled throughout the book are comments 
dealing with the historical development of abstract algebra. 

This book has been class-tested at Creighton University and at the Univer¬ 
sity of Calcutta. During preparation of the manuscript, we used an approach 
which would help students who need a text to pass different types of aptitude 
tests in algebra. 

In Chapter 1, the necessary ideas of sets, relations, functions, and binary 
operations are presented. We recommend that the chapter be gone through 
quickly in order to provide enough time to cover essential topics from abstract 
algebra. The students can refer back to material omitted on the first pass, as 
needed. For example, Zorn’s lemma may be omitted on the first reading. It is 
not needed until Chapter 17. 

Chapters 2 through 6 contain basic results on group theory. Most of the 
material in these chapters should be covered in the first semester. Chapters 10 
through 14 contain basic results on ring theory. Most of the results in these 
chapters should also be covered in the first semester. 

The second semester should cover Chapters 15 through 17. These chapters 
deal with Euclidean domains, unique factorization domains, and prime and 
maximal ideals. Students should now be well prepared to study field theory in 
the remaining part of the semester. Those who have not had a course on linear 
algebra should spend some time on vector spaces in Chapter 20. The students 
should finish the semester with Chapter 21 and as much of Chapter 22 through 
24 as possible. There is plenty of material remaining from which special topics 
may be chosen. 

We have included chapters on coding theory and Grobner bases so that 
the student can gain some appreciation of the applications of abstract algebra. 
The chapter on coding theory contains enough material to allow the student 
to see applications of groups, ideals, and fields. We present a chapter on 
Grobner bases because of its currency. It can be a first step into the area of 
computational algebra. The chapter also provides important applications of 
commutative algebra. 

We would like to thank Professor James K. Deveney of Virginia Common¬ 
wealth University and his abstract algebra class for their valuable suggestions. 
We express our sincere gratitude to Fr. Michael Proterra, Dean, Creighton Col¬ 
lege of Arts and Sciences, for making possible Dr. Sen’s visit during 1992-1993. 


We would like to thank Dr. Mark J. Wierman for showing us many important 

Pitte 7^o^Lc#fta£uiai P4y.J.ZcJ. 



XV 


features of LaTex which were very helpful in preparing this manuscript in its 
present form and also for drawing all diagrams in the book. In addition, we 
express our sincere thanks to Dr. T.K. Mukherjee, Dr. S. Ganguly, and Dr. 
S.R. Lopez-Permouth for their critical comments. We are very thankful to our 
families for their constant support and encouragement throughout this project. 
We would like to give special thanks to Shelly Malik, who constantly inquired 
about the manuscript and counted each chapter every time the manuscript 
was printed. Finally we would like to thank Karen Minette of McGraw-Hill for 
making this project a success. 

We welcome any comments concerning the text. The comments may be 
forwarded to the following e-mail addresses: malik@bluejay.creighton.edu or 
mordes @bluej ay. creighton. edu 


D. S. Malik 
John N. Mordeson 
M. K. Sen 
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Chapter 1 

Sets, Relations, and Integers 


The purpose of this introductory chapter is mainly to review briefly some famil¬ 
iar properties of sets, functions, and number theory. Although most of these 
properties are familiar to the reader, there are certain concepts and results 
which are basic to the understanding of the body of the text. 

This chapter is also used to set down the conventions and notations to be 
used throughout the book. Sets will always be denoted by capital letters. For 
example, we use the notation N for the set of positive integers, Z for the set of 
integers, Z# for the set of nonnegative integers, E for the set of even integers, 
Q for the set of rational numbers, Q + for the set of positive rational numbers, 
Q* for the set of nonzero rational numbers, R for the set of real numbers, R + 
for the set of positive real numbers, R* for the set of nonzero real numbers, C 
for the set of complex numbers, and C* for the set of nonzero complex numbers. 


1.1 Sets 

We will not attempt to give an axiomatic treatment of set theory. Rather we 
use an intuitive approach to the subject. Consequently, we think of a set as 
some given collection of objects. A set S with only a finite number of elements 
is called a finite set; otherwise S is called an infinite set. We let \S\ denote 
the number of elements of S. We quite often denote a finite set by a listing of 
its elements within braces. For example, {1,2,3} is the set consisting of the 
objects 1,2,3. This technique is sometimes used for infinite sets. For instance, 
the set of positive integers N may be denoted by (1, 2,3,...}. 

Given a set S', we use the notation x 6 S and x £ S to mean a: is a member 
of S and x is not a member of S, respectively. For the set S = {1,2,3}, we 
have 1 C S and 4 £ S. 

A set A is said to be a subset of a set S if every element of A is an element 
of S. In this case, we write ACS and say that A is contained in S. If A C S, 
but 4^5, then we write A C S and say that A is properly contained in S or 
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that A is a proper subset of S. As an example, we have {1,2,3} C {1,2,3} 
and {1, 2} C {1,2, 3}. 

Let A and B be sets. If every member of A is a member of B and every 
member of B is a member of A, then we say that A and B are the same or 
equal. In this case, we write A — B. It is immediate that A = B if and only 
if A C B and B C A. Thus, we have the following theorem. 

Theorem 1-1.1 Let A and B be sets. Then A = B if and only if A C B and 

BCA.m 

The null set or empty set is the set with no elements. We usually denote 
the empty set by fi. For any set A, we have C A. The later inclusion follows 
vacuously. That is, every element of <f is an element of A since <fi has no 
elements. 

We also describe sets in the following manner. Given a set S , the notation 

A = {x | x E S', P(a;)} 
or 

A = {x e S 1 P(x)} 

means that A is the set of all elements x of S such that x satisfies the property 
P. For example, N — {x \ x E Z, x > 0}. 

We can combine sets in several ways. 

Definition 1.1.2 The union of two sets A a.nd B , written AuJB, is defined 
to be the set 

Au B — {x \ x € A or x € B}. 

In the above definition, we mean x is a member of A or x is a member of 
B or x is a member of both A and B. 

Definition 1.1.3 The intersection of two sets A and B, written An B, is 
defined to be the set 


AC\B — {x\x£A and x € B}. 


Here x is an element of A n B if and only if x is a member of A and at the 
same time x is a member of B. 

Let A and B be sets. By the definition of the union of sets, every element 
of A is an element of Au5. That is, A C A U B. Similarly, every element of 
B is also an element of A U B and so B C A U B. Also, by the definition of 
the intersection of sets, every element of A fl B is an element of A and also an 
element of B. Hence, AflB C A and An B C B. We record these results in 
the following theorem. 
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Theorem 1.1.4 Let A and B be sets. Then the following statements hold: 

(i) A C AU B and B C All B. 

(ii) A n B C A and A fl B C B. ■ 

The union and intersection of two sets A and B is described pictorially in 
the following diagrams. The shaded area represents the set in question. 


AUB AHB 

Two sets A and B are said to be disjoint if A D B — (p. 

Example 1.1.5 Let A be the set { 1 , 2 , 3 , 4 } and B be the set { 3 , 4 , 5 , 6 }. Then 

AUB = { 1 , 2 , 3 , 4 , 5 , 6 } 

and AOB = { 3 , 4 }. If C is the set { 5 , 6 }, then 

AuC = { 1 , 2 , 3 , 4 , 5 , 6 } 




while A fl C = (f. 

Now that the union and intersection have been defined for two sets, these 
operations can be similarly defined for any finite number of sets. That is, 
suppose that A\, A2, ..., A n are n sets. The union of A±, A2, ..., A n , denoted 
by yJf^Ai or A\ U A2 U • • ■ U A n , is the set of all elements x such that x is 
an element of some A z , where 1 < i < n. The intersection of A\, A2, ..., A n , 
denoted by C\f =1 Ai or A\ D A2 Pi • • • Pi A n , is the set of all elements x such that 
x £ Aj for allf, 1 < i < n. 

We say that a set I is an index set for a collection of sets A if for any 
a E /, there exists a set A a e A and A = {A a \ a £ I}. I can be any nonempty 
set, finite or infinite. 

The union of the sets A a , a £ I, is defined to be the set {x \ x £ A a for 
at least one a £ 1 } and is denoted by U a £iA a . The intersection of the sets 
A a , a £ I, is defined to be the set {x \ x £ A a for all a £ 1 } and is denoted 
by D ae/ A a . 


Definition 1.1.6 Given two sets A a,nd B , the relative complement of B 
in A, denoted by the set difference A\B , is the set 

A\B = {x | x £ A, but x ^ B}. 
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The following diagram describes the set difference of two sets. 



Example 1.1.7 Let A = {1,2, 3,4} and B = { 3,4, 5,6}. Then A\B = {1,2}. 

We now define a concept which is a building block for all of mathematics, 
namely, the concept of an ordered pair. 

Definition 1.1.8 Let A and B be nonempty sets and x £ A, y £ B. 

(i) The ordered pair (x,y) is defined to be the set {{x},{x,y}}. 

(ii) The Cartesian cross product (Cartesian product) of A and B, 
written A x B, is defined, to be the set 

Ax B — {(as,y) \ x £ A,y £ B}. 

Let (x,y), (z,w) £ A x B. We claim that (x,y) = ( z,w ) if and only if 
x — z and y = w. First suppose that x — z and y — w. Then {{a:}, {x,y}} = 
{{z}, {z,u>}} and so (x, y) = ( z , w). Now suppose that (x, y) = (z, w). Then 

{{x}, {x, y}} = {{z},{z,w}}. 

Since {x} € {{a:}, {x, y}}, it follows that {x} 6 {{^}, {z, r^}}. This implies that 
{x} - {z} or {x} = {z,w}. If {x} = {z}, then we must have {x,y} = {z,w}. 
From this, it follows that x — z and y = w. If {x} = {z,w}, then we must have 
{x,y} = {z}. This implies that x = z = w and x = y = z. Thus, in this case, 
x — y ~ z = w. This establishes our claim. 

It now follows that if A has m elements and B has n elements, then A x B 
has rrm elements. 

Example 1.1.9 Let A = {1,2,3} a,nd B = {3,4}. Then 

A x B = {(1,3), (1,4), (2,3), (2,4), (3, 3), (3,4)}. 

For the set R of real numbers, the Cartesian product R x R is m,erely the 
Euclidean plane. 

Definition 1.1.10 For any set X , the power set of X, written V(X), is de¬ 
fined to be the set {A \ A is a subset of X}. 
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Example 1.1.11 Let X = {1,2,3}. Then 

P(X) = {0,{1},{2},{3},{1,2},{1,3},{2,3},{1,2,3}}. 

Here V{X) has 2 3 elements. 

Remark 1.1.12 Let P and Q be statements. Throughout the text we will 
encounter questions in which we will be asked to show that P if and only if 
Q; that is, show that statement P is true if and only if statement Q is true. 
In situations like this, we first assume that statement P is true and show that 
statement Q is true. Then we assume that statement Q is true and show that 
statement P is true. The statement P if and only if Q is also equivalent to 
the statement: if P, then Q , and if Q, then P. For example, see Worked-Out 
Exercise 1, below. 

1.1.1 Worked-Out Exercises 

0 Exercise 1 Prove for sets A and B that A C B if and only if A U B = B. 

Solution: First suppose A C B. We now show that A U B = B. Let x be 
any element of A U B. Then either x £ A or x £ B. This implies that x £ B 
since A C B. Thus, we find that every element of A U B is an element of B and 
so A U B C B. Also, B C A U B by Theorem 1.1.4(i). Hence, A U B = B. 

Conversely, suppose AU B — B. Now by Theorem 1.1.4(i), A C AU B. 
Since A U B = B, it now follows that A C B. 

0 Exercise 2 For a subset A of a set S , let A' denote the subset S\A. A' is 
called the complement of A in S. Let A and B be subsets of S. Prove 
that (A D B)' = A' U B' , DeMorgan’s law. 

Solution: First we show that (A D B)' C A' U B' . Then we show that 
A' U B' C (A n B)'. The result then follows by Theorem 1.1.1. 

Let x be any element of (A D B )'. Now (A n B)' = S\(A D B) and so x £ S 
and x A fl B. Also, x £ A fl B implies that either x ^ A or x B. If x £ S 
and x £ A, then x £ A!, and if x £ S and x £ B, then x £ B'. Thus, either 
x £ A' or x £ B ', i.e., x £ A' U B '. Hence, (A fl B)' CAT B'. 

Let us now show that A' U B 1 C (An B)'. Suppose x is any element of 
A' U B'. Then either x £ A' or x £ B'. Suppose x £ A' , then x £ S and x £ A. 
Since A fl B C A and x £ A, we must have x An B. This implies that 
i E (An B)'. Similarly, we can show that if x £ B', then x £ A n B, i.e., 
ie(An B)'. Hence, A! U B' C (A n B)'. Consequently, (A n B)' = A' U B'. 

0 Exercise 3 Let A, B , and C be sets. Prove that 

An(BuC) = (AnB)u(AnC). 
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Solution: As in the previous exercise, we first show that A fl (B U C) C 
(i4n5)U(An C) and then (A fl B) U {A Pi C) C An(B UC). The result then 
follows by Theorem 1.1.1. 

Let x be any element of A n (B U C). Then x E A and x E B U C. Thus, 
x E A and x E B or x E C. If x E A and x E B, then x E A fl JB, and if 
x E A and then x E A fl C. Therefore, x^ADBotxgAdC. Hence, 

x E {AO B) U (An C). This shows that A fl (B U C) C (An B) U (An C). 

Let us now show that (A 0 B) LJ (A fl C) C A fl (B U C). Suppose x is any 
element of (AnB) U (AnC). Then x E AnB or x E AnC. Suppose x E AnB, 
then x E A and x E B. Since B C B U (7, we have i G 5 U C, Thus, x E A 
and i G BUC and soiGdn(BuC), Similarly, if x E A and x G C, then 
x E dfl (BUd). Hence, (A D 13) U (A D C) C A fl (B U C). Consequently, 
An(BuC) = (AnB)u(AnC). 

1.1.2 Exercises 

1. Let A = {i,y, z} and B = {y,ic}. Determine each of the following sets: 

AuB, AflB, A\B , B\A, A x B, and V(A). 

2. Prove for sets A and B that A C B if and only if An B = A. 

3. Prove for sets A, H, and C that 

(i) A U B = B U A and A fl B = B n A, 

(ii) (AUB)UC = AU(BUC) and (A fl B) D C = A D (H n C), 

(iii) Au(BnC) = (AuB)n(AU C), 

(iv) AU(AnB) = A, 

(v) A n (A U B) = A. 

4. If a set S' has 12 elements, how many elements does V(S) have? How 

many of these are properly contained in S? 

5. For subsets A and B of a set S, prove DeMorgan’s law: 

(A U B)' = A'n B'. 

6. The symmetric difference of two sets A and B is the set 

AAB = (AuB)\(AnB). 

(i) If A = {a, 6, c} and B — {6, c, d, e}, find A A IB. 

(ii) Show that A A B = (A\B) U (B\A). 
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7. Let A and B be finite subsets of a set S. Show that 

(i) if A fl B = </>, then \A U B\ — \A\ + |H| , 

(ii) |4\B| = \A\ -\AC\B\, 

(iii) \A U B\ = |A| + |B| - \A n B\. 

8. In each of the following exercises, write the proof if the statement is true; 
otherwise give a counterexample. The sets A, B, and C are subsets of a 
set U. 

(i) A n (B\C) = (A n B)\(A n C). 

(ii) A\(B U C) = ( A\B ) U C. 

(iii) (A\B)' = (B\A) r . 

(iv) A x (B U C) = (A x B) U (A x C). 

(v) A A C = B A C implies A = B. 

1.2 Integers 

Throughout abstract algebra, the set of integers provides a source of examples. 
In fact, many algebraic abstractions come from the integers. An axiomatic 
development of the integers is not given in this text. Instead, certain basic 
properties of integers are taken for granted. For example, if n and m are 
integers with n < m, then there exists a positive integer t G Z such that 
m = n + t. In this section, we review and prove some important properties of 
the integers. 

The proofs of many results of algebra depend on the following basic principle 
of the integers. 

Principle of Well-Ordering: Every nonempty subset of Z^ has a smallest 
(least) element, i.e., if 0 ^ S C Z#, then there exists x G S such that 
x < y for all y 6 S. 

Let S be a subset of Z#. Suppose that S ha s the following properties: 

(i) no € 5, i.e., there exists an element no G S. 

(ii) For all n > no, n 6 Zif n € S, then n + 1 G S. 

We show that the set of all integers greater than or equal to no is a subset 
of S y i.e., 

{n G Z # | n > no} C S. 

Let T denote the set {n G Z * | n > n o}. We wish to show that T C S. On 
the contrary, suppose T S. Then there exists a G T such that a <£ S. Let T\ 
be the set of all elements of T that are not in S, i.e., Ti = T\S. Since a el 
and a ^ S', we have a G Tj. Thus, T\ is a nonempty subset of Z # . Hence, by the 
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principle of well-ordering, T\ has a smallest element m, say. Then m G T and 
m ^ S. Since m G T, m > uq. If m, — no, then m. G S', a contradiction. Thus, 
m > uq. This implies that m — 1 > no and so m — 1 G T. Now m — 1 ^ Ti since 
m is the smallest element of T}. Since m — 1 G T and m — 1 ^ Ti, we must have 
m - 1 G 5. But then by (ii), m = (m — 1) + 1 G 5, which is a contradiction. 
Hence, T C S'. 

Thus, from the principle of well-ordering, we deduce another important 
property of integers. This property is known as the principle of mathematical 
induction. We thus have the following theorem. 


Theorem 1.2.1 (Principle of Mathematical Induction) LetS C Z*. Let 
no G S. Suppose S satisfies either of the following conditions. 

(i) For all n > no, n G Z#, if n G S, then n + 1 G S. 

(ii) For all m < n, n G Z 7 ^, if m E S , then n G S’. 

Then 

{n G Z 7 ^ | n > no} C S’. ■ 


We proved, above, Theorem 1.2.1, when S satisfies (i). We leave it for the 
reader to prove Theorem 1.2.1 if S satisfies (ii). 

We have seen the following mathematical statement in a college algebra or 
in a calculus course. 


1 + 2 +-h n 


n(n + 1) 
2 


n > 1. 


We now show how this statement can be proved using the principle of 
mathematical induction. Let S’(n) denote the above mathematical statement, 
i.e., 

S(n) : 1 + 2 + ■•■ + »= n ( n f ^ , n> 1. 

This statement will be true if the left-hand side of the statement is equal to 
the right-hand side. Let 


S = {n G Z # | S(n ) is true}. 


That is, S is the set of all nonnegative integers n for which the statement S(n ) 
is true. We will show that S is the set of all positive integers. Now 

, _ i-q + i) 

2 

i.e., S( 1) is true. Hence, 1 G S. Let n be an integer such that n > 1 and 
suppose S(n) is true, i.e., n G S. We now show that S(n + 1) is true. Now 


S(n + 1) : 


1 + 2 +-h n + (n + 1) = 


(n + l)(n + 2) 
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Consider the left-hand side. 

l + 2 + -- -+ n+(n + l) = vTh+H -f (n + 1) (since S(n ) is true) 

(n+l)(n+2) 

~ 2 

Hence, the left-hand side is equal to the right-hand side and so S(n + 1) is 
true. Thus, n -1-1 E S. Hence, by the principle of mathematical induction, 
5 = {ne Z# | n > 1}. This proves our claim, which in turn shows that 


1 + 2 + • - • + n 


n(n + 1) 
2 


is true for all positive integers n. 


Sometimes we use the word induction for the principle of mathematical in¬ 
duction. 


A proof by the principle of mathematical induction consists of three steps. 
Step 1 : Show that no E S, i.e., the statement S{n o) is true for some 
no E Z # . 

Step 2 : Write the induction hypothesis: n is an integer such that n > no 
and n E S, i.e., S(n) is true for some integer n such that n > ng (or k is an 
integer such that ng < k < n and S(k) is true). 

Step 3: Show that n + 1 E 5, i.e., S(n + 1) is true. 

Example 1.2.2 In this example, we show that 2n + 1 < 2 n for all n > 3. 

Let S{n) be the statement: 

S(n ) : 2n + 1 < 2 n , n > 3. 

Since we want to show that S(n ) is true for all n > 3, as the first step of our 
induction, we m,ust verify that S( 3) is true. Let n = 3. Now 2n + l = 2-3+1 = 7 
and 2 n — 2 3 = 8. Thus, for n = 3, 2n + 1 < 2 n . This shows that S( 3) is true. 
Suppose that 2n + 1 < 2 n for some n > 3, i.e., Sin ) is true for some n > 3. 
Consider S{n + 1), 

S(n+1) : 2(n + 1) + 1 < 2 n+1 . 

Let us evaluate the left-hand side of S(n + 1). We have 
2 (n + 1) + 1 — 

< 

< 

Thus, S(n + 1) is true. Hence, by the principle of mathematical induction, 
2n + 1 < 2 n for all n > 3. 

“Pi mJuc. 7feaiAe»tattea/ 
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The principle of mathematical induction is a very useful tool in mathemat¬ 
ics. We will make use of this result throughout the text. 

We now prove the following important properties of integers with the help 
of the principle of well-ordering. 

Theorem 1.2.3 (Division Algorithm) Let x, y £ Z with y ^ 0. Then there 
exist unique integers q and r such tha,t x = qy + r, 0 < r < \y\ . 

Proof. Let us first assume y > 0. Then y > 1. Consider the set 

S = {x — uy | u £ Z ,x — uy > 0}. 

Since y > 1, we have x — (— \x\)y — x + |x| y > 0 so that x — (— \x\)y £ S. 
Thus, S is a nonempty set of nonnegative integers. Hence, by the principle 
of well-ordering, S must have a smallest element, say, r. Since r £ S, we have 
r > 0 and r = x — qy for some q £ Z. Then x = qy + r. We must show that 
r < \y\ . Suppose on the contrary that r > |t/| — y. Then 

x - (q + T)y = {x - qy) - y = r - y > 0 

so that r — y £ S, a contradiction since r is the smallest nonnegative integer 
in S and r — y < r. Hence, it must be the case that r < |y|. This proves the 
theorem in case y > 0. 

Suppose now that y < 0. Then \y\ >0. Thus, there exist integers q' , r such 
that x = q' \y\ + r, 0 < r < \y\ by the above argument. Since y < 0, |y| = -y. 
Hence, x — — q'y + r. Let q = ~q r . Then x = qy + r, 0 < r < \y\ , the desired 
conclusion. 

The uniqueness of q and r remains to be shown. Suppose there are integers 
</, r' such that 

x = qy + r = q'y + r', 

0 < r' < \y\, 0 < r < \y\ . Then 

r' - r = (q- q)y. 


Thus, 


\r -r =\q-q\\y\. 


Now — \y\ < — r < 0 and 0 < r' < \y\. Therefore, if we add these inequalities, 
we obtain 

- \y\ < r' -r < \y \, 


or | r' — r | < \y\. Hence, we have 

0 < \q — (/] < 1. 


XI < L - 
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Since q — q' is an integer, we must have 0 = \q — q'\ . It now also follows that 
\r — r'\ = 0. Thus, q — q' = 0 and r — r' = 0 or q = q' and r — r’. Consequently, 
q and r are unique. ■ 

In Theorem 1.2.3, the integer q is called the quotient of x and y on dividing 
x by y and the integer r is called the remainder of x and y on dividing x by 

y- 

The following corollary is a special case of Theorem 1.2.3. 

Corollary 1.2.4 For any two integers x and y with y > 0, there exist unique 
integers q and r such that x = qy + r, where 0 < r < y. 

Proof. By Theorem 1.2.3, there exist unique integers q and r such that 
x = qy + r, where 0 < r < |?/|. Since y > 0, \y\ = y. Hence, x = qy + r, where 
0<r<y.M 

Definition 1.2.5 Let x, y E Z with x ^ 0. Then x is said to divide y or x is 
a divisor (or factor) of y, written x\y, provided there exists q E Z such that 
y = qx. When x does not divide y , we sometimes write x / y. 

Let x, y , z be integers with x ^ 0. Suppose x\y and x\z. Then for all integers 
s and t, x\(sy + tz). We ask the reader to prove this fact in Exercise 5(iii) (page 
19 ). 

Definition 1.2.6 Let x, y E Z. A nonzero integer c is called a common 
divisor of x and y if c\x and c\y. 

Definition 1.2.7 A nonzero integer d is called a greatest common divisor 
(gcd) of the integers x and y if 

(i) d\x and d\y, 

(ii) for all c E Z if c\x and c\y, then c\d. 

Let d and d! be two greatest common divisors of integers x and y. Then d\d' 
and d'\d. Hence, there exist integers u and v such that d' = du and d = d'v. 
Therefore, d = duv , which implies that uv — 1 since d / 0. Thus, either 
u = v = 1otu = v = — 1. Hence, d' — Ed. It now follows that two different 
gcd’s of x and y differ in their sign. Of the two gcd’s of x and y , the positive 
one is denoted by gcd(rr,?/). For example, 2 and —2 are the greatest common 
divisors of 4 and 6. Hence, 2 = gcd(4,6). 

In the next theorem, we show that the gcd always exists for any two nonzero 
integers. 

Theorem 1.2.8 Let x, y € Z with either x ^ 0 or y ^ 0. Then x and y have 
a positive greatest common divisor d. Moreover, there exist elements s, t G Z 
such that d = sx + ty. 
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Proof. Let 

S — {mi + ny \ m, n 6 Z, mx + ny > 0}. 

Suppose x 7 ^ 0. Then 

x if x > 0 

—x if x < 0 

lx + Oy if x > 0 

(—l)x + 0 y if x < 0 . 

Hence, |x| € S and so 5 / By the well-ordering principle, S contains a 
smallest positive integer, say, d. We now show that d is the greatest common 
divisor of x and y. 

Since d E S, there exist s, t 6 Z such that d = sx + ty. First we show that 
d\x and d\y. Since d ^ 0, by the division algorithm (Theorem 1.2.3), there exist 
integers q and r such that 

x = dq 4 - r, 

where 0 < r < |d| = d. Thus, 
r — x — dq 

= x — (sx -I- ty)q (substituting for d) 

= (1 -qs)x + (-qt)y. 

Suppose r > 0. Then r € S, which is a contradiction since d is the smallest 
element of S and r < d. Thus, r = 0. This implies that x = dq and so d\x. 
Similarly, d\y. Hence, d satisfies (i) of Definition 1.2.7. Suppose c|x and c\y for 
some integer c. Then c|(sx + ty) by Exercise 5(iii) (page 19), i.e., c\d. Thus, d 
satisfies (ii) of Definition 1.2.7. Consequently, d = ged (x,y). ■ 

Let x and y be nonzero integers. By Theorem 1.2.8, gcd(x,?/) exists and 
if d = ged (x,y), then there exist integers s and t such that d = sx 4 - ty. The 
integers s and t in the representation d = sx + ty are not unique. For example, 
let x = 45 and y — 126. Then gcd(x,y) = 9, and 9 = 3 ■ 45 + (—1) • 126 — 
129 • 45 + (-46) • 126. 

The proof of Theorem 1.2.8 does not indicate how to find gcd(x, y) or the 
integers s, t. In the following, we indicate how these integers can be found. 

Let x, ?/ G Z with y ^ 0. By the division algorithm, there exist q\,r\ e Z 
such that 

x = qiy + r u 0 < n < \y\. 

If 7~i 7 ^ 0, then by the division algorithm, there exist 92 , ^2 € Z such that 

0 < r 2 < n. 



y = g2n + r 2 , 
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If r 2 7 ^ 0, then again by the division algorithm, there exist < 73 , r 3 € Z such 
that 

r\ = q^r 2 + r 3 , 0 < r 3 < r 2 . 

Since r\ > r 2 > r 3 > 0, we must in a finite number of steps find integers 
q n , < 7 n +i, and r n > 0 such that 

'T'n —2 = qn.T'n— 1 d“ I'm 0 <C 7+ <C 

7"n—1, — Qn+l^n 0. 

We assert that r n (the last nonzero remainder) is the greatest common 
divisor of x and y. Now r n \r n ~i. Since r n \r n , r n |r n _!, and r n _ 2 = q n r n -i + r n , 
we have r n |r n _ 2 by Exercise 5(iii) (page 19). Working our way back in this 
fashion, we have r n \r\ and r n |r 2 . Thus, r n \y since y = q 2 r\ + r 2 . Since r n \y, 
r n |ri, and x = q±y + r\, we have r n \x. Hence, r n is a common divisor of x and 
y. Now if c is any common divisor of x and y, then we see that c\ri. Since c\y 
and c|ri, c\r 2 . Continuing, we finally obtain c\r n . Thus, r n = gcd (x,y). 

We now find s,tgZ such that gcd(x, y) — sx + ty as follows: 

r n = r n _ 2 + r n ^i(-q n ) 

— ^n —2 “I - [^n—3 r n— 2 ( qn— 1 )]( Qn) 

= r n -${-q n ) + r n _ 2 (l + q n -\q n ) (simplifying). 

We now substitute r n _4 +r„_ 3 (—g„_ 2 ) for r n _ 2 . We repeat this “back” substi¬ 
tution process until we reach r n = sx + ty for some integers s and t. 

We illustrate the above procedure for finding the gcd and integers s and t 
with the help of the following example. 

Example 1.2.9 Consider the integers 45 and 126. Now 

126 = 2-45 + 36 
45 = 1-36 + 9 
36 = 4-9 + 0 

Thus, 9 = gcd(45,126). Also, 

9 = 45-1-36 

= 45-1-[126 -2-45] 

= 3-45 +(-1) • 126. 


Here s = 3 and t = — 1. 

We now define prime integers and study their basic properties. 

Definition 1.2.10 (i) An integer p > 1 is called prime if the only divisors of 
p are ±1 and ±p. 

fii) Two integers x and y are called relatively prime if gcd(x.y) = 1. 

TWl JPL^tU^a.tla.a.L 
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The following theorem gives a necessary and sufficient condition for two 
nonzero integers to be relatively prime. 

Theorem 1.2.11 Let x andy be nonzero integers. Thenx cmdy are relatively 
prime if and only if there exist s,t £ Z such that 1 = sx + ty. 

Proof. Let x and y be relatively prime. Then gcd(x,j/) — 1. By Theorem 
1.2.8, there exist integers s and t such that 1 = sx + ty. 

Conversely, suppose 1 = sx + ty for some pair of integers s, t. Let d = 
gcd(x, y). Then d\x and d\y and so d\(sx + ty ) (by Exercise 5(iii) (page 19)) or 
djl. Since d is a positive integer and d\l, d = 1. Thus, gcd(;r,y) = 1 and so x 
and y are relatively prime. ■ 

Theorem 1.2.12 Let x, y, z £ Z with r / 0. If x\yz and x, y are relatively 
prime, then x\z. 

Proof. Since x and y are relatively prime, there exist s,f G Z such that 
1 = sx+ty by Theorem 1.2.11. Thus, z = sxz+tyz. Now x\x and by hypothesis 
x\yz. Thus, x\{sxz + tyz) by Exercise 5(iii) (page 19) and so x\z. ■ 

Corollary 1.2.13 Let x,y,p £ Z with p a prime. If p\xy, then either p\x or 

p\y- 

Proof. If p\x , then we have the desired result. Suppose that p does not 
divide x. Since the only positive divisors of p are 1 and p , we must have that 
p and x are relatively prime. Thus, p\y by Theorem 1.2.12. ■ 

The following corollary is a generalization of Corollary 1.2.13. 

Corollary 1.2.14 Let xi,X 2 ,..., x n ,p £ Z with p a prime. If 

P 1^12^2 " ’ " , 


then p\xi for som,e i , 1 < i < n. 

Proof. The proof follows by Corollary 1.2.13 and induction. ■ 

Consider the integer 24. We can write 24 = 2 3 • 3. That is, 24 can be 
written as product of prime powers. Similarly, 49500 = 2 2 • 3 2 ■ 5 3 • 11. In the 
next theorem, called the fundamental theorem of arithmetic, we prove that any 
positive integer can be written as product of prime powers. 

“Public. 7^a.£/La#>ta£lea./ ~Pt±y-£-LG-A. 
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Theorem 1.2.15 (Fundamental Theorem of Arithmetic) Any integer n 
> 1 has a unique factorization (up to order) 

n — V\V e <i ■ - - P e s s , ( 1 . 1 ) 


where pi,P 2 , ■ ■ ■, p s are- distinct primes and e\, e 2 ,..., e s are positive integers. 


Proof. First we show that any integer n > 1 has a factorization like Eq. 
(1.1) and then we show the uniqueness of the factorization. 

We show the existence of the factorization by induction. If n = 2, then 
clearly n has the above factorization as a product of prime powers. Make the 
induction hypothesis that any integer k such that 2 < k < n has a factorization 
like Eq. (1.1). If n is prime, then n already has the above factorization as a 
product of prime powers, namely n itself. If n is not prime, then n = xy for 
integers x,y, with 1 < x < n and 1 < y < n. By the induction hypothesis, 
there exist primes qi, q 2 ,. .., qk, q[, q' 2 ,. ■ ■, q't and positive integers ei, e 2 , . .., 
e k , e'j, e^,.. ., e' t such that qi,q 2 ,..., qk are distinct primes, q[ , q 2 ,. .., q[ are 
distinct primes and 



Thus, 


ei 69 6 u 

n = qiq 2 ---q k qiQ2" m Qt- 


i. e., n can be factored as a product of prime powers. If qi = q'■ for some i and 

/g ( g ■ -|- g ( 

j, then we replace q^qj j by qd 3 . It now follows that n = p e fp e 2 ■ ■ • p ^ s , where 
Php2, ■ • •, Ps are distinct primes and ei, e 2 ,..., e s are positive integers. Hence, 
by induction, any integer n > 1 has a factorization like (1.1). 

We now prove the uniqueness property by induction also. If n = 2, then 
clearly n has a unique factorization as a product of prime powers. Suppose the 
uniqueness property holds for all integers k such that 2 < k < n. Let 


n = p^pl 2 ■ ■ -pl s = q^ql 2 ■ • ■ q? 


( 1 . 2 ) 


be two factorizations of n into a product of prime powers. Suppose n is prime. 
Then in Eq. (1.2), we must have s = t = 1 and e\ = 1 = c\ since the only 
positive divisors of n are 1 and n itself. This implies that n = p\ = q\ and so 
the factorization is unique. 

Suppose n is not a prime. Now p\\n and 


n 


Pi 


ei—1 69 

= Pi Pi • • ■ P 


e s 

5 


is an integer. If s = 1, then n = p ^ and since n is not a prime, we have e\ > 1. 

Hence, T- = p e P~ l > 2. If s > 1, then — = Pi 1-1 Pp 2 ■ • -pt 3 > 2. Thus, in either 

1 miziAcMiaiLca/ 
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case, is an integer > 2. Now pi|n implies that pilq^q^ 2 ‘ ‘ ‘ q? anc ^ so by 
Corollary 1.2.14, p\\q*? for some i. By reordering the g* if necessary, we can 
assume that i — 1. Thus, pil^ 1 and so by Corollary 1.2.14, pi\qi- Since p\ and 
qi are primes, p\ = q\. Thus, 

-=pV~ 1 p1 2 ---pV =P? _1 92 2 (1-3) 

Pi 

Now e\ — 1 = 0 if and only if c\ — 1 = 0. For suppose e\ — 1 = 0 and c\ — 1 > 0. 
Then — = p • • • p e s s implies that p\ / — and -r- = p < \~ l q < 2 " ‘ (ft implies that 

P 1 P 1 .Pi 

which is of course impossible. We can get a similar contradiction if we 
assume e\ — 1 > 0 and ci — 1 = 0. 

Now ^ is an integer and 2 < ^ < n. Hence, by the induction hypothesis, 
we obtain from Eq. (1.3) that s — t, and p\ = gi,..., p s = q s (without 
worrying about the order), and e\ — 1 = C\ — 1, e 2 = C 2 ,..., e s = c s . Hence, 
by induction, we have the desired uniqueness property. ■ 


Corollary 1.2.16 Any integer n < —1 has a unique factorization (up to order) 

n= (-1 )pTp £ 2 -‘'P e s s , 


where pi,p2> • • • iPs are distinct primes and ei, e 2 ,..., e s are positive integers. 


Proof. Since n < —1, —n > 1. Hence, by Theorem 1.2.15, —n has a unique 
factorization (up to order) 


where pi,p 2 , • • • ,p s ar e distinct primes and e\, e 2 ,..., e s are positive integers. 
Thus, 

n = {-l)p\ l p% ■■■P e s s , 

where p\,... ,p s are distinct primes and ej,... ,e s are positive integers. ■ 

Theorem 1.2.15 says that any positive integer greater than 1 can be written 
as a product of prime powers. Now we pose the obvious question: How many 
prime numbers are there? This is answered by the following theorem due to 
Euclid. 


Theorem 1.2.17 (Euclid) There are an infinite num.ber of primes. 

~Pu*uc- 
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Proof. Let P\,P2, • ■ • ,p n be a finite number of distinct primes. Set x = 
P\P 2 • • -pn + 1- Since pi does not divide 1, pi does not divide x, i = 1, 2,..., n. 
By the fundamental theorem of arithmetic, it follows that there is some prime 
p such that p\x. Thus, p is distinct from Pi,P2, ■ ■ • ,Pn so that we have n + 1 
distinct primes. That is, for any finite set of primes we can always find one 
more. Thus, there must be an infinite number of primes. ■ 

We close this section with the following definition. There are a few places 
in the text where we will be making use of it. 

Definition 1.2.18 Let n be a positive integer. Let (j)(n ) denote the num,ber of 
positive integers m such that m<n and gcd (m,n) = 1, i.e., 

4>(n) = |{m £ N | m < n and gcd(m,n) — 1} |. 

0(n) is called the Euler <f>-function. 

Clearly 0(2) = 1, 0(3) = 2, 0(4) = 2. Since 1, 5, 7, 11 are the only positive 
integers less than 12 and relatively prime to 12, 0(12) = 4. 

Let {<ii,..., a n } C Z. We use the notation X^X=i a i to denote the sum of 
Q-i, . . . , i.e., 

n 

T: di = cli + • * ■ + a n . 

2=1 

If S is any finite subset of Z, then a denotes the sum of all elements of 

S. For example, if S = {2 ,4,7}, then a = 2 + 4 + 7 = 13. 

1.2.1 Worked-Out Exercises 

0 Exercise 1 By the principle of mathematical induction, prove that 

3 2 n+l _|_ ^—!) n 2 = 0 ( mO d 5) 

for all positive integers n. (For integers a and b, a = b (mod 5) means 5 
divides a — b .) 

Solution: Let S(n) be the statement 

S(n) : 3 2n+1 + (—l) n 2 = 0(mod 5), n > 1. 

We wish to show that S(n) is true for all positive integers. We first must verify 
that S( 1) is true as the first step of our induction. Let n = 1. Then 

3 2 n+i + (_!)n 2 = 3 2 +i + = 27 - 2 = 25 = 0(mod 5). 
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Thus, S( 1) is true. Now suppose that S(n) is true for some positive integer n, 
i.e., 3 2n+1 -f (—l) n 2 = 0(mod 5) for some integer n > 1. We now show that 

S(n + 1) : 3 2 (n+i)+i + = 0(mod 5) 

is true. Now 

3 2(n+l)+i + (_ 1 )n+i 2 = 3 2 «+l . 32 _ (_ 1 )n 2 

= 9(3 2n+1 + (—l) n 2) — (—l) n 18 — (—l) n 2 
= 9(3 2n+1 + (—l) n 2) — (—l) n 20. 

Since 3 2n+1 + (—l) n 2 = 0(mod 5) and 20 = 0(mod 5), it follows that 3 2 ( n + 1 )+ 1 -|_ 
(—l) n+1 2 = 0(mod 5). This shows that S{n+ 1) is true. Hence, by the principle 
of mathematical induction, 3 2n+1 + (—l) n 2 = 0(mod 5) for all positive integers 
n. 

<0 Exercise 2 Let a and b be integers such that gcd(a,4) = 2 and gcd(6,4) = 
2. Prove that gcd(a + b, 4) = 4. 

Solution: Since gcd(a,4) = 2, 2|a, but 4 does not divide a. Therefore, 
a = 2x for some integer x such that gcd(2,;c) = 1. Similarly, b — 2y for some 
integer y such that gcd(2,y) = 1. Thus, x and y are both odd integers. This 
implies that ar-f- y is an even integer and so x + y = 2n for some integer n. Now 
a T b — 2(x + y) — 4n. Hence, gcd(a + 6,4) = gcd(4n, 4) = 4. 

0 Exercise 3 Let a, 6, and c be integers such that gcd(a,c) = gcd(6,c) = 1. 
Prove that gcd(a6, c) = 1. 

Solution: If c = 0, then gcd(a, 0) — gcd(6,0) = 1 implies that a = ±1 and 
b = ±1. Thus, gcd(a6,c) = gcd(±l,0) = 1. Suppose now c ^ 0. By Theorem 
1.2.8, gcd(a6, c) exists. Let d = gcd(a6, c). Also, by Theorem 1.2.8, there exist 
integers xi,y\,X2,y2 such that 1 — azi+q/i, 1 = bx 2 + cy 2 . Thus, (ax\)(bx 2 ) = 
(l-cyi)(l-cy 2 ) = l-cyi-cy 2 +cyicy 2 . Hence, 1 = (ab)x 1 x 2 +c(yi+y 2 -cyiy 2 )■ 
Thus, any common divisor of ab and c is also a divisor of 1. Hence, d\\. Since 
d > 0, d = 1. 


Exercise 4 Let a, b 6 Z with either a ^ 0 or b ^ 0. Prove that for any integer 


gcd(a, 6) = gcd(a, — b ) = gcd(a, b + ac ). 


Solution: Suppose a / 0. Then gcd(a, 6), gcd(a, — b) and gcd(a, b + ac) 
exist. Let d = gcd(a, 6). Then there exist integers x and y such that d = 
ax + by = ax + (—&)(—y). Thus, any common divisor of a and —6 is also a 
divisor of d. Hence, gcd(u, — 6)|d. Similarly, d|gcd(a, —6). Since gcd(a, b) and 


gcd(a, — b) are positive, gcd(a,6) = gcd (a.—b) 
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Let e = gcd(a, b + ac). Then there exist integers p and q such that e = 
ap + (b + ac)q = ap + bq + acg = a(p + cq ) + bq. Since d\a and d\b, d\e. Also, 
d = ax + by — ax + (b + ac)y — acy — a(x — cy ) + (b + ac)y. Since e| a and 
e\b + ac, e\d. Hence, e = d. 

0 Exercise 5 Find integers x and y such that 512a; + 320 y = 64. 

Solution: 

512 - 320 • 1 + 192 

320 - 192 • 1 + 128 

192 = 128-1 + 64 
128 = 64 • 2 + 0. 

Thus, 64 - 192 - 128 - 192 - (320 - 192) = 192 • 2 + 320 • (-1) = (512 - 320) • 

2 + 320 ■ (-1) = 512 • 2 + 320 • (-3). Hence, x = 2 and y = -3. 

1.2.2 Exercises 

1 . Determine gcd(90,252). Find integers s and t such that 

gcd(90, 252) = s ■ 90+ t -252. 


2. Find integers s and t such that gcd(963,652) = s ■ 963 + t ■ 652. 

3. Find integers s and t such that 657s + 963t — 9. 


4. Use the principle of mathematical induction to prove the following. 

(i) l 2 + 2 2 + 3 2 + • • ■ + n 2 = "("+ 1 K 2 "+ 1 ) , n = 1 ,2 ,.... 

(ii) 7 n — 1 is divisible by 6 for all n 6 Z#. 

(iii) 6 ■ 7 n — 2 ■ 3 n is divisible by 4 for all n € Z # . 

(iv) 5 2n + 3 is divisible by 4 for all n G Z#. 

(v) n < 2 n for all n G Z^. 

(vi) 2 n > n 2 , n = 4, 5,.... 

(vii) n! > 3 n , n = 7, 8 ,.... 


5. Let a, 6 , and c be three integers such that a ^ 0. Prove the following: 

(i) If a\b, then a| 6 c for all c G Z. 

(ii) If b 7 ^ 0, a |6 and b\c, then a|c. 

(iii) If a\b and a|c, then a\(bx + cy) for all G Z. 

(iv) If a, b are positive integers such that a\b, then a < 6 . 

(v) If b ^ 0, a| 6 , and 6 |a, then a = ± 6 . 

“Piifix*. 
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6. Let a, 6, and c be integers. Prove that if ac / 0 and ac|6c, then a\b. 

7. Let a, b, c, and d be integers such that a / 0 and 6/0. Prove that if a[c 
and b\d, then a6|cd. 

8. Let p be a prime integer, m, n integers and r a positive integer. Suppose 
p r \mn and p /m. Show that p r \n. 

9. Let a and 6 be integers and gcd(a, 6) = d. If a = dm and 6 = dn , prove 
that gcd(m, n) — 1. 

10. Let a, 6, and c be positive integers. Prove that gcd(a6, ac) = agcd(6, c). 

11. Prove that if gcd(x, y) — gcd(x, z) — 1, then gcd(x, yz) = 1 for all x,y,z E 

N. 

12. Prove that if gcd (x,y) = l,x| z, and y\z, then xy\z for all x,y,z € N. 

13. Let o, 6 E N. Show that gcd(o, 6) = gcd(a, a + 6). 

14. Prove that gcd(a, 6) = 1 for any two positive consecutive integers a and 
6 . 


15. Let x and y be nonzero integers. The least common multiple of x and 
y , written lcm(x,y), is defined to be a positive integer m such that 

(i) x\m and y\m and 

(ii) if x\c and j/|c, then m|c. 

Prove that lcm(x, y) exists and is unique. 

16. Let x and y be nonzero integers. Prove that lcm(x,y) • gcd(x,j/) = \xy\. 

17. Let x and y be nonzero integers. Show that lcm(x,y) = \xy\ if and only 
if gcd (x,y) = 1. 

18. Show that there are infinitely many prime integers of the form 6n — 1, 
n > 1. 


19. 


Let S' be a set with n elements, n > 1. Show by mathematical induction 
that |P(S)| = 2 n . 


20. Determine whether the following assertions are true or false. If true, then 
prove it, and if false give a counterexample. 

(i) If p is a prime such that p\a 5 , then p\a, where a is an integer. 

(ii) If p is a prime such that p|(a 2 + 6 2 ) and p|a, then p\b, where a and 6 
are integers. 

“Pufuc. 7feoiAe»ta^Lca/ 
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(iii) For any integer a, gcd(a, a + 3) = 1 or 3. 

(iv) If gcd(a,6) = 3 and gcd(6,6) = 3, then gcd(a + 6,6) = 6, where a 
and 6 are integers. 

(v) If gcd(6, c) = 1 and a|6, then gcd(a,c) = 1. 


1.3 Relations 


Some describe or define mathematics as the study of relations. Since a relation 
is a set of ordered pairs, we get our first glimpse of the fundamental importance 
of the concept of an ordered pair. 

Definition 1.3.1 A binary relation or simply a relation R from, a set A 
into a set B is a subset of A x B. 

Let R be a relation from a set A into a set B. If (x,y) G R, we write xRy 
or R(x) = y. If xRy, then sometimes we say that x is related to y (or y is in 
relation with x) with respect to R or simply x is related to y. If A = B, then 
we speak of a binary relation on A. 

Example 1.3.2 Let A denote the nam.es of all states in the USA and B = Z. 
With each state a in A associate a.n integer n which denotes the number of 
people in that state in the year 1996. Then R — {( a,n ) | a G A and n is the 
number of people in state a in 1996} is a. subset of A x Z. Thus, R defines a. 
relation from A into Z. 

Example 1.3.3 Consider the set of integers Z. Let R be the set of all ordered 
pairs (m,n) of integers such that m < n, i.e., 

R = {(m, n) G Z x Z | m < n}. 

Then R is a.binary relation on Z. 


Let R be a relation from a set A into a set B. By looking at the elements 
of R , we can find out which elements of A are related to elements of B with 
respect to R. The elements of A that are related to elements of B form a subset 
of A, called the domain of R, and the elements of B that are in relation with 
elements of A form a subset of B , called the range of R. More formally, we 
have the following definition. 


Definition 1.3.4 Let R be a relation from, a set A into a set B. Then the 
domain of R, denoted by V(R), is defined to be the set 

{x | x G A and there exists y G B such that (x,y) G R}. 

The range or image of R, denoted byT(R), is defined to be the set 

{y | y G B and, there exists x G A such that ( x,y ) G i?}. 
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Example 1.3.5 Let A = {4,5, 7, 8, 9} and B — {16,18,20,22}. Define R C 
Ax B by 

R = {(4,16), (4,20), (5,20), (8,16), (9,18)}. 

Then R is a relation from, A into B. Here (a, b) £ R if and only if a 
divides 6, where a £ A and b £ B. Note that for the domain of R, we have 
V(R) = {4,5,8,9} and for the range of R, we have T(R) = {16,18,20}. 

Example 1.3.6 Let S — {( x,y ) | x, y £ R, x 1 -1- y 1 — 1, y > 0}. Then S is a 
binary relation on R. S is the set of points in the Euclidean plane constituting 
the semicircle lying above the x-a,xis with center (0,0) and radius 1. 

Definition 1.3.7 Let R be a binary relation on a set A. Then R is called 

(i) reflexive if for all a; £ A, xRx, 

(ii) symmetric if for all x,y £ A, xRy implies yRx , 

(in) transitive if for all x,y,z £ A, xRy and yRz imply xRz. 

Definition 1.3.8 A binary relation E on a set A is called an equivalence 
relation on A if E is reflexive, symmetric, and transitive. 

The important concept of an equivalence relation is due to Gauss. We will 
use this concept repeatedly throughout the text. 

Example 1.3.9 Let A = { 1,2,3,4,5, 6} and E = {( 1,1), (2,2), (3,3), (4,4), 

(5,5), (6,6), (2,3), (3,2)}. Then E is an equivalence relation on A. 

Example 1.3.10 (i) Let L denote the set of all straight lines in the Euclidean 
plane and E be the relation on L defined by for all I 1 J 2 € L , (l\, I 2 ) £ E if and 
only if l\ and I 2 are parallel. Then E is an equivalence relation on L. 

(ii) Let L be defined as in (i) and P be the relation defined on L by for all 
l\fl2 £ L, (l\, If) £ P if and only ifCand I 2 are perpendicular. Let l be a line in 
L. Since l cannot be perpendicular to itself, (1,1) ^ P. Hence, P is not reflexive 
and so P is not an equivalence relation on L. Also, P is not transitive. 

Example 1.3.11 Let n be a fixed positive integer in Z. Define the relation = n * 
on Z by for all x,y £ Z, x = n y if and only if n\(x — y), i.e., x — y = nk for 
som.e k £ Z. We now show that = n is an equivalence relaiion on Z. 

(i) For all x £ Z, x — x = 0 = On. Hence, for all x £ Z, x = n x. Thus, = n 

is reflexive. 

(ii) Let x,y £ Z. Suppose x = n y. Then there exists q £ Z such that 

qn — x — y. Thus, (-q)n = y — x and so n\(y — x), i.e., y = n x. Hence, = n is 

symmetric. 

(Hi) Let x,y,z £ Z. Suppose x = n y and y = n z. Then there exist q, r £ Z 
such that qn = x — y and rn — y — z. Thus, (q + r)n = x — z and q + r £ Z. 

This implies that x = n z. Hence, = n is transitive. 

Consequently, = n is an equivalence xelation a©*Z. _ . —. 
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The equivalence relation, = n , as defined in Example 1.3.11 is called congru¬ 
ence modulo n. (Another commonly used notation for x = n y is x = y(mod 

n)>) 

Definition 1.3.12 Let E be an equivalence relation on a set A. For all x £ A, 
let [x] denote the set 

[x\ = {y € A | yEx}. 

The set [x] is called the equivalence class (with respect to E) determined 
by x. 

In the following theorem, we prove some basic properties of equivalence 
classes. 

Theorem 1.3.13 Let E be an equivalence relation on the set A. Then 

(i) for all x £ A, [x] ^ (f>, 

(ii) if y £ [x], then [x] = [y], where x,y £ A, 

(in) for all x,y £ A, either [x] = [; y\ or [x] D [y] = <f>, 

(iv) A — U xGj 4 [x], i.e., A is the union of all equivalence classes with respect 
to E. 


Proof. (i) Let x £ A. Since E is reflexive, xEx. Hence, x £ [x] and so 
[x] ^ <f>. 

(ii) Let y £ [x]. Then yEx and by the symmetric property of E , xEy. In 
order to show that [x] = [y], we will show that [x] C [y\ and [y] C [x]. The 
result then will follow by Theorem 1.1.1. Let u £ [y]. Then uEy. Since uEy and 
yEx , the transitivity of E implies that uEx. Hence, u £ [x]. Thus, [y] C [x]. 
Now let u £ [x]. Then uEx. Since uEx and xEy, uEy by transitivity and so 
u £ [y]. Hence, [x] C [y]. Consequently, [x] = [y]. 

(iii) Let x,y £ A. Suppose [x] fl [y] ^ fi. Then there exists u £ [x] fl [y]. 
Thus, u £ [x] and u £ [y], i.e., uEx and uEy. Since E is symmetric and uEy , 
we have yEu. Now yEu and uEx and so by the transitivity of E, yEx. This 

* implies that y £ [x]. Hence, by (ii), [y] = [x]. 

(iv) Let x £ A. Then x £ [x] C \J x ^[x\. Thus, A C U x ga[ x }- Also, 

U X £a[x] C A. Hence, A = ■ 


One of the main objectives of this section is to study the relationship be¬ 
tween an equivalence relation and a partition of a set. We now focus our 
attention to partitions. We begin with the following definition. 

Definition 1.3.14 Let A be a set and V be a collection of nonempty subsets of 
A. Then V is called a partition of A if the following properties are satisfied: 

(i) for all B, C £V, either B = C or B D C = (f>. 

(ii} A = U b&vB. 
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In other words, if V is a partition of A , then (i) B C A for all B € V, i.e., 
every element of V is a subset of A, (ii) distinct elements of V are either equal 
or disjoint, and (iii) the union of the members of V is A. 

Example 1.3.15 (i) Let A = {1, 2,3,4, 5,6}. Let A\ — {1}, A<i — {2,4,6}, 
and A?, = {3,5}. Now A = A\ U As U A 3 , Ai n A 2 = 0, A\ D A 3 — 0, and 
A 2 n A 3 = 0. Hence, V = {A\, A 2 , A 3 } is a partition of A. 

fa) Consider Z. Let A be the set of all even integers and B be the set of all 
odd integers. Then A D B = 0 and AUB = Z. Thus, {A, B} is a partition of 
Z. 


The following theorem is immediate from Theorem 1.3.13. 

Theorem 1.3.16 Let E be an equivalence relation on the set A. Then 

V = {[x] | x G A} 

is a partition of A. ■ 

Example 1.3.17 Consider the equivalence relation = n on Z as defined in Ex¬ 
ample 1.3.11. Let 7i n — {[x] | x G Z}. By Theorem. 1.3.16, Z n is a partition of 
Z. Suppose n = 6. We claim, that 

Z 6 = {(0],[1],[2],[3].|4],[5]} 

and 


[i] = {0 + i, ±6 + i , ±12 + i,.. .} = {6<? + i | q G Z} for all i 6 Z. 

Let 0 < n < m < 6. Suppose [n] = [m\. Then m G [n] and so 6|(m — n ). 
This is a contradiction since 0 < m — n < 6. Hence, the equivalence classes 
[0], [1], [2], [3], [4], [5] are distinct. We now show that these are the only distinct 
equivalence classes. 

Let k be any integer. By the division algorithm, k = 6 q+r for som,e integers 
q a,nd r such that 0 < r < 6. Thus, k — r — 6 q and, so 6|(A: —r). This implies that 
k =6 r and so [fc] — [r]. Since 0 < r < 6 we have [r] G {[0], [1], [2], [3], [4], [5]} 
a,nd so [&;] G {[0], [1], [2], [3], [4], [5]}. This proves our first claim. 

Let i G Z. Then x G [i] if and only if 6|(x — i) if and only if 6q = x — i for 
some q G Z if and only if x = 6q + i for some q G Z. This proves our second 
claim.. It now follows that for all i = 0,1,..., 5, [i] = [6 q + i\ for all q G Z. 
Hence, 

for i = 0, [0] = [6] = (12] = • ■ ■ = [-6] = [-12] = • • •; 

/or i = 1, [1] = [7] = [13] = ■ ■ ■ = [-5] = [-11] = • • ■; 

fori = 2, [2] = [8] = [14] = - - - = [-4] = [-10] = •••; 

for i = 3, [3] = [9] = [15] = • ■ • = [-3] = [-9] = • ■ ■; 

for i = 4, [4] = [10] = [16] = • - • = [-2] = [-8] = • • •; 
for i = 5, [5] = [11] = [17] = ••• = [-1] = [-7] = •••■ 
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By Theorem 1.3.16, given an equivalence relation E on a set A, the set of 
all equivalence classes forms a partition of A. We now prove that corresponding 
to any partition, we can associate an equivalence relation. 

Theorem 1.3.18 Let V be a partition of the set A. Define a relation E on A 
by for all x,y £ A, xEy if there exists B £ V such that x, y £ B. Then E is an 
equivalence relation on A and, the equivalence classes are precisely the elements 
ofV. 

Proof. Note that if two elements x and y of A are related, i.e., xEy , then 
x and y must belong to the same member of V. Also, if B £ V> then any two 
elements of B are related, i.e., xEy for all x,y £ B. We now prove the result. 

Since V is a partition of A, A = U bgvB. First we show that E is reflexive. 
Let x be any element of A. Then there exists B £ V such that x £ B. 
Since x,x £ B, we have xEx. Hence, E is reflexive. We now show that E is 
symmetric. Let xEy. Then x,y £ B for some B £ V. Thus, y, x £ B and 
so yEx. Hence, E is symmetric. We now establish the transitivity of E. Let 
£ A. Suppose xEy and yEz. Then x, y £ B and y,z £ C for some B, 
C £ V. Since ?/ € BdC, Bf)C (f. Also, since V is a partition and BC\C <f, 
we have B — C so that x, z £ B. Hence, xEz. This shows that E is transitive. 
Consequently, E is an equivalence relation. 

We now show that the equivalence classes determined by E are precisely 
the elements of V. Let x £ A. Consider the equivalence class [x]. Since A = 
UbgpB, there exists B £ V such that x £ B. We claim that [x] = B. Let 
u £ [x]. Then uEx and so u £ B since x £ B. Thus, [x] C B. Also, since 
x £ B, we have yEx for all y £ B and so y £ [x] for all y £ B. This implies 
that B C [x]. Hence, [x] = B. Finally, note that if C £ V, then C = [n] for all 
u £ C. Thus, the equivalence classes are precisely the elements of V. H 

The relation E in Theorem 1.3.18 is called the equivalence relation on 
A induced by the partition V. 


New relations can be constructed from existing relations. For example, 
given relations R and S from a set A into a set B , we can form relations R fl S', 
R\JS , R\S , (A x B)\R in a natural way. In all these relations, the domain and 
range of the relations under consideration are subsets of A and B , respectively. 
Now given a relation R from a set A into a set B and a relation S from B 
into a set C , there is a relation from A into C that arises in a natural way as 
follows: Let us denote the new relation by T. Suppose (a, b) £ R and (6, c) G S. 
Then we make ( a,c ) £ T. Every element of T is constructed in this way. That 
is, (a, c) £ T for some a £ A and c £ C if and only if there exists b £ B such 
that (a, b) £ R and (b, c ) € S. This relation T is called the composition of R 


and S and is denoted by S o R. Note that to form the composition of R and S , 
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we must have the domain of S and the range of R to be subsets of the same 
set. More formally we have the following definition. 

Definition 1.3.19 Let R be a relation from a set A into a set B and S be a 
relation from B into a set C. The composition of R and S } denoted by S o R, 
is the relation from. A into C defined by 

x(S o R)y if there exists z € B such that xRz and zSy 

for all x E A,y E C. 

Let R be a relation on a set A. Recursively, we define a relation R n , n E N, 
as follows: 

R l = R 

R n = R o R n ~ 1 if n > 1. 

Definition 1.3.20 Let R be a relation from, a, set A into a set B. The inverse 
of R, denoted by R _1 , is the relation from B into A defined by 

xR~~ l y if yRx 

for all x E B, y E A. 

The following theorem gives a necessary and sufficient condition for a binary 
relation to be an equivalence relation. 

Theorem 1.3.21 Let R be a relation on a set A. Then R is an equivalence 
relation on A if and only if 

(i) A C R, where A = {(x,x) | x E A}, 

(ii) R = R -1 , and 
(in) R o R C R. 


Proof. Suppose R is an equivalence relation. Let (x,x) E A, where x E A. 
Since R is reflexive, (x,x) E R. Hence, AC R, i.e., (i) holds. Let ( x,y ) 6 R. 
Since R is symmetric, (y,x) E R. Thus, by the definition of R -1 , (x,y) E R -1 . 
Hence, R C R -1 . On the other hand, let ( x,y ) E R _1 . Then (y,x) E R. 
Therefore, by the symmetric property, (x,y) E R. Hence, R -1 C R. Thus, 
R = R _1 , i.e., (ii) holds. We now prove (iii). Let (x,y) E R o R. Then 
there exists 2 E A such that (x, z) E R and ( 2 , j/) E R. Since R is transitive, 
(x,y) E R. Thus, RoRCR, i.e., (iii) holds. 

Conversely, suppose that (i), (ii), and (iii) hold for R. For all x E A, (x,x) E 
A C R. Thus, R is reflexive. Next, we show that R is symmetric. Let (x, y) E 
R. Then by (ii), (x,y) E R _1 . This implies that (?/,x) E R. Hence, R is 
symmetric. For the transitivity of R, let (x,z) E R and (z,y) E R. Then 
(x,y) ERoR by the definition of composition of relations. Since RoRCR, 


(x, y) E R. Hence, R is transitive. Consequently, R is an equivalence relation. 
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1.3.1 Worked-Out Exercises 

<0> Exercise 1 In Zio, which of the following equivalence classes are equal: [2], 
[-5], [5], [-8], [12], [15], [-3], [7], [22]? 

Solution: We note that [2] = [2 + 10] — [12], [—8] = [—8 + 10] = [2], [12] = 
[12 + 10] = [22], [-5] = [-5 + 10] = [5] = [5 + 10] = [15] and [-3] = [—3 + 10] = 
[7]. Also, [2] 7 ^ [5], [2] ^ [7] and [5] ^ [7]. Hence, it now follows that [2] = [ 12 ] = 
[-8] = [22], [-5] - [5] - [15] and [-3] = [7]. 

Exercise 2 Let R be a reflexive and transitive relation on a set S. Prove that 
R fl R~ l is an equivalence relation. 

Solution: Since (x, x) E R for all x £ S, (x, x) E R 1 for all x E S. 
Thus, (x,x) E Rr\R _1 for all x 6 S. Hence, RnR~ l is reflexive. Let (x,y) £ 
ROR- 1 . Then (x,y) 6 R and (x,y) £ R _1 . Thus, (y,x) £ R~ l and (y, x) £ R. 
Therefore, (y, x) £ R(~] R~ 1 . Hence, Rf 1 R~ l is symmetric. Now suppose that 
(x, y), (y, z) £ R n R~ l . Then (x, y), (y, z) £ R and (x, y), (y, z) £ R~ x . Since 
R is transitive, (x,z) £ R. Now since (x,y),(y,z) £ R~ l , (y,x),(z,y) £ R. 
Since R is transitive, (z, x) £ R and so (x,z) £ i? -1 . Thus, (x,z) £ R D i? -1 . 
Hence, R D i? -1 is transitive. We have thus proved that R fl R~ l is reflexive, 
symmetric, and transitive and hence R fl R~ l is an equivalence relation. 

0 Exercise 3 Give an example of an equivalence relation on the set S = {1, 
2, 3, 4,5,6,7,8} such that R has exactly four equivalence classes. 

Solution: i?={(l,l), (2,2), (3,3), (4,4), (5,5), (6,6), (7,7), (8,8), (1,2), 
(2,1), (3,4), (4,3), (5,6), (6,5), (7,8), (8,7)}. The equivalence classes are [1] = 
[2], [3] = [4], [5] = [6], and [7] = [8], 

Exercise 4 Let R\ and R 2 be two symmetric relations on a set S. Prove that 
Ri o R 2 is symmetric if and only if R\ o R 2 — R 2 ° R \. 


Solution: Suppose R\ o R 2 is symmetric. Let (x, y) be any element of 
R\ o R 2 . Then (y,x) £ R\ o R 2 since R\ o R 2 is symmetric. Thus, there exists 
z £ S such that (y, z) £ R 2 and (z, x) £ R\ by the definition of composition of 
relations. Since R\ and R 2 are symmetric, (z, y) £ R 2 and (x,z) £ R\. Hence, 
(x, y) £ R 2 o Ri. Thus, R\oR 2 C R 2 o Ri. Similarly, R 2 o Ri C R± o R 2 . Hence, 

R\ 0 R2 — R2 0 R\ ■ 

Conversely, suppose that R\ o R 2 = R 2 o R Y . Let (x,y) £ R\ o R 2 . Then 
(x, y) £ R 2 oR i. Thus, there exists z £ S such that (x, z) £ R\ and (z, y) £ R 2 . 
Since R\ and R 2 are symmetric, (z, x) £ Ri and (y,z) £ R 2 . Hence, (y, x) £ 
R 2 o Ri = Ri o R 2 . Thus, R\ o R 2 is symmetric. 
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<> Exercise 5 Let A = {1,2,3,4,5} and R = {(1,1), (2,2), (3,3), (4,4), (5,5), 
(1,2), (2,1), (4,5), (5,4)}. Show that R is an equivalence relation. 

Solution: Let B = {1,2}, C = {3}, and D — {4,5}. Let V = {B,C,D}. 
Then V is a partition of A. Also, note that if x,y € A , then (x,y) E R if and 
only if x,y G X for some X E P, i.e., the relation R is induced by the partition 
V. Hence, R is an equivalence relation on A by Theorem 1.3.18. 

0 Exercise 6 Let X = {1,2, 3,4, 5,6, 7}. Then 

V = {{1,3,5}, {2,6}, {4,7}} 

is a partition of X. List the elements of the corresponding equivalence 
relation R on X induced by V. 

Solution: R = {(a, b) E X x X | a and b both belong to the same element 
of IP}. Then R = {(1,1), (2,2), (3,3), (4,4), (5,5), (6,6), (7,7), (1,3), (3,1), 
(1,5), (5,1), (3,5), (5,3), (2,6), (6,2), (4,7), (7,4)}. 

Exercise 7 Let R be a relation on a set S. Prove that the following conditions 
are equivalent. 

(i) R is an equivalence relation on S. 

(ii) R is reflexive and for all a, 6 , c E -S', if aRb and bRc, then cRa. 

Solution: (i)=>(ii): Suppose R is an equivalence relation on S. Then R is 
reflexive. Let a, 6 , c E S. Suppose aRb and bRc. The transitive property of R 
implies that aRc. Hence, cRa since R is symmetric. 

(ii)=>(i): Since R is given to be reflexive, to show that R is an equivalence 
relation, we only need to check that R is symmetric and transitive. For sym¬ 
metry, suppose aRb. Since R is reflexive, we have aRa. Now since we have aRa 
and aRb, bRa by hypothesis. This shows that R is symmetric. To show that 
R is transitive, suppose aRb and bRc. Then by the hypothesis, cRa. Since we 
have shown that R is symmetric, cRa implies that aRc. Hence, R is transitive. 
Consequently, R is an equivalence relation on S. 


1.3.2 

1. 


Exercises 

Let R be a relation on the set A — {1,2, 3,4,5,6, 7} defined by R — 

{(a, b) E A x A | 4 divides a — 6}. 

(i) List the elements of R. 

(ii) Find the domain of R. 

(iii) Find the range of R. 
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(iv) Find the elements of R~ l . 

(v) Find the domain of R' 1 . 

(vi) Find the range of R _1 . 

2. Let R be a relation on the set A = {1,2,3,4,5, 6} defined by R = {(a, 6) € 
A x A | a b 9}. 

(i) List the elements of R. 

(ii) Is A C R, where A = {(x, x) \ x E A}? 

(iii) Is R = R~ l l 

(iv) Is R o R C R7 

3. Which of the following relations E are equivalence relations on the set of 
integers Z? 

(i) xEy if and only if x — y is an even integer. 

(ii) xEy if and only if x — y is an odd integer. 

(iii) xEy if and only if x < y. 

(iv) xEy if and only if x divides y. 

(v) xEy if and only if x 2 = y 2 . 

(vi) xEy if and only if |a;| — \ y\. 

(vii) xEy if and only if \x — y\ <2. 

4. Let R = {(a, b) \ a, b £ Q and a — b E Z}. Prove that R is an equivalence 
relation on Q. 

5. Let A = {1,2,3,4, 5,6, 7,8}. Define a relation R on A by 

aRb if and only if 3 divides a — b 

for all a,ii G A Show that R is an equivalence relation on A. Find the 
equivalence classes [1], [2], [3], and [4]. 

6. Let R be an equivalence relation on a set A. Find the domain and range 
of R. 

7. Find all equivalence relations on the set S = {a,6,c}. 

8. In Zg, which of the following equivalence classes are equal: [—1], [2], [8], 
[5], [-2], [11], [23]? 

9. Let x, y G Z be such that x = n y , where n E N. Show that for all z E Z, 

(i) x + z = n y + z, 

(ii) xz = n yz 
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10. Let x,y : z,w G Z and n be a positive integer. Suppose that x = n y and 
z = n w. Show that x + z = n y + w and xz = n yw. 

11. Let n be a positive integer and [cc], [y] G Z n . Show that the following 
conditions are equivalent. 

(i) [x] = [y\. 

(ii) x — y — nr for some integer r. 

(iii) n\(x-y). 

12. (Chinese Remainder Theorem) Let m and n be positive integers such 
that gcd(m, n) = 1. Prove that for any integers a and 6, the congruences 
x =m a and x = n b have a common solution in Z. Furthermore, if u and 
v are two solutions of these congruences, prove that u = mn v - 

13. Define relations R \, R 2 , R 3 such that Ri is reflexive and symmetric but 
not transitive, R 2 is reflexive and transitive but not symmetric, and R 3 
is symmetric and transitive but not reflexive. 

14. Prove that the intersection of two equivalence relations on a set S is an 
equivalence relation on S. 

15. Let jR be a relation on a set A. Define T (R) = U {(x, x) \ x G A}. 

Show that T(R) is reflexive and symmetric. 

16. Let R be a relation on a set S. Set R°° = R U R 2 U R? U • ■ •. Prove the 
following: 

(i) R°° is a transitive relation on S. 

(ii) If T is a transitive relation on A such that R C T, then R°° C T. 

(. R°° is called the transitive closure of R.) 

17. Let Ri and R 2 be symmetric relations on a set S such that R\ o R 2 C 
i?2 0 R\ • Prove that R 2 o Ri is symmetric and Ri o R 2 = R 2 o R\. 

18. Let R\ and R 2 be equivalence relations on a set S such that R\ o R 2 = 
R 2 ° R \. Prove that Ri o is an equivalence relation. 


19. Let R\ and be relations on a set S. Determine whether each statement 
is true or false. If the statement is false, give a counterexample. 

(i) If Ri and i? 2 are reflexive, then R\o R 2 is reflexive. 

(ii) If R\ and i? 2 are transitive, then R\o R 2 is transitive. 

(iii) If R\ and i? 2 are symmetric, then Rio R 2 is symmetric. 

(iv) If Ri is transitive, then 1S transitive. 

(v) If Ri is reflexive and transitive, then Ri o R 1 is transitive. 
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1.4 Partially Ordered Sets 

In the previous section, we defined binary relations and studied their basic 
properties. More specifically, we looked at equivalence relations and showed 
that equivalence relations and partitions are closely related. In this section, we 
will consider binary relations which are reflexive, are transitive, and satisfy a 
new property, called antisymmetric. We begin with the following definition. 

Definition 1.4.1 A relation R on a set S is called a partial order on S if it 
satisfies the following conditions: 

(i) (a,a) G R for all a G S (i.e., R is reflexive). 

(ii) For all a, b G S if (a,b) G R and (b,a) G R , then a = b (i.e., R is 

antisymmetric). 

(Hi) For all a,6,c G S, if (a,b) G R and (6, c) G R, then (a, c) G R (i.e., R 
is transitive). 

In other words, a reflexive, antisymmetric, and transitive relation on a set 
S is called a partial order on S. 

Example 1.4.2 Let R be the relation on Z defined by R = {(a, 6) G Z x Z | 
a — b < 0}. We show that R is a partial order on Z. 

First note that a — a = 0 < 0 for all a G Z. Thus, (a, a) G R for all a G Z 
and so R is reflexive. For antisymmetry, let (a, 6), (6, a) G R. Then a — 6 < 0, 
i.e., a < b and b — a < 0, i.e., b < a. This implies that a — b. Thus, R is 
antisymmetric. Finally, we show that R is transitive. Let ( a,6),(6, c) G R. 
Then a — b < 0 and, b — c < 0. Thus, a < b and, b < c. This implies that a < c 
and so a — c < 0. Hence, (a, c) G R. Thus, R is transitive. Consequently, R is 
a partial order on Z. 


Example 1.4.3 Let R be the relation on N defined by R = {(a, b) G N x N | 
a divides b in N}. Then R is a partial order on N. 

As in the previous example, we show that R is reflexive, antisymmetric, 
and transitive. 

Reflexive: Let a G N. Since a = la, we have a\a and so (a, a) G R. Thus, 
R is reflexive. 

Antisymmetric: Let (a,b), (b,a) G R. Then a\b and, 6|a. Thus, b = ad and 
a = be for som,e positive integers c and d. Therefore, a — be — adc and so 
1 = cd. Since c and d are positive integers and cd = 1, it follows that c = d = 1. 


Hence, a = b. Thus, R is antisymmetric. 

Transitive: Let (a, b), (b, c) G R. Then a\b and b\c in N. Thus, b = an and 
c = bm for som,e positive integers m, and n. This implies that c — bm. = anm, 
and since m, and n are positive integers, nm, is a positive integer. Thus, a|c in 
N and so (a, c) G R. Hence, R is transitive. 

Consequently, R is a partial order on N. 
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Example 1.4.4 Consider the relation R = {(a, 6 ) E Z x Z | a divides b in 
Z} on Z. As in the previous example, we can show that R is reflexive a.nd 
transitive. Since 6 = (—1)(— 6 ) and —6 = (—1)6, ( 6 , — 6 ) G R and (— 6 , 6 ) G R, 
but 6 7 ^ — 6 . Thus, R is not antisymmetric, proving that R is not a partial order 
on Z. 


Example 1.4.5 Let S be a set and V{S) the power set of S. Let R be a relation 
on 'P(S) given by R — {{A,B) G V(S)xV(S) \ A C B }. We show that R is a 
partial order on V(S). Since A C A for all A G P(S), we find that {A, A) G 
R for all A G V{S). This shows that R is reflexive. For antisymmetry, let 
(A, B ), (B, A) G R. Then by the definition of R, A C B and B C A and so A = 
B. Thus, R is antisymmetric. To show that R is transitive, let ( A , B), (B, C) G 

R. Then A C B and B C C and so A C C. Thus, (A, C) G R. Hence, R is 

transitive. Consequently, R is a partial order on V(S). 

A partial order on a set S is usually denoted by < . Instead of writing 
(a, b) G <, from now on we shall write a < b. 

Definition 1.4.6 A set S together with a partial order is called a partially 
ordered set (poset). 

If S is a partially ordered set with partial order <, then we write ( S , <). 

In Example 1.4.2, R is a partial order. This relation is the usual “less 

than or equal to” relation on Z. In Example 1.4.3, R is a partial order. We 

call this relation the divisibility relation on N. Hence, N together with the 
divisibility relation is a poset. From Example 1.4.4, we find that Z together 
with the divisibility relation is not a poset. The partial order in Example 1.4.5 
is known as set inclusion relation. V(S) together with set inclusion relation 
is a poset. 

Let S be a poset and a, b G S. If either a < b or b < a, then we say that a 
and b are comparable. 

Definition 1.4.7 A partially ordered set ( S , <) is called a linearly ordered 
set or a chain if for all x,y G S either x < y or y < x. 


Thus, a linearly ordered set or a chain is a poset in which any two elements 
are comparable. 


Example 1.4.8 (i) Z together with the usual “less than, or equal to” (Example 
1A.2) relation is a chain. 

(ii) N with the divisibility relation (Example 1.4.3,) is not a chain because 
neither 3 divides 5 nor 5 divides 3, i.e., 3 and 5 are not comparable. 

(Hi) Let S be a set with more than one element. Then V(S) together with 
the set inclusion relation (Example 1.4.5,) is not a chain since if a and b are 
distinct elements of S, then neither {a} is a subset of {b} nor { 6 } is a subset 
of {a}, i.e., {a} and {6} are not comparable. 
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Definition 1.4.9 Let ( S , <) be a poset and {a, b] be a subset of S. An elem.ent 
c €E S is called an upper bound of {a, b} if a < c and b < c. 

An elem.ent d G S is called a least upper bound (lub) of {a, 6} if 

(i) d is an upper bound of {a, b} and 

(ii) if c € S is an upper bound of {a, 6 }, then d < c. 

Example 1.4.10 (i) Consider the set N together with the divisibility relation 
(Example 1.4.3,). For all a, b N, a < b if and, only if a divides b. Now for the 
subset {4,6}, 12,24,36 are all upper bounds o/{4,6}. However, 12 is the least 
upper bound of {4,6}. 

(ii) Consider the set Z together with the usual u less than or equal to” 
relation (Example 1.4.2,). For the subset {4,6}, 6,7,8, ... are all upper bounds 
o/{4, 6 }. However, 6 is the least upper bound o/{4,6}. 

(in) Let S = {1,2, 3, 4}. Let < denote the set inclusion rela.ti.on (Example 
1.4.5,). Then (V(S),<) is a poset. Let A = {1,2} and B = {1,4}. Then 
j4UB = {1,2,4} is the least upper bound of { A , B }. 

Remark 1.4.11 (i) In a poset (S, <), a subset {a, 6 } of S may not have an 
upper bound. 

(it) In a poset (S, <), a subset {a, 6 } of S m,a,y have m.ore than one upper 
bound. 

(in) In a poset (S, <), a subset {a, 6 } of S m,a,y not have a lub. 

(iv) In a poset (S, <), if a subset {a, 6 } of S has a lub, then this lub is 
unique. 

We leave the verification of (i), (ii), and (iii) as an exercise and verify (iv). 
Let c, d 6 S be two lubs of {a,b}. Then c and d are upper bounds of {a, 6 }. 
Since c is a lub of {o, 6 } and d is an upper bound of {a, 6 }, c < d. Similarly, 
d < c. Hence, c — d. 

Notation: The lub of {a, 6 } in (S', <), if it exists, is denoted by a V b. 

Definition 1.4.12 Let (S, <) be a poset and {a, b} be a subset of S. An elem.ent 
c E S is called a lower bound of {a, 6 } if c < a and c < b. An element d £ S 
is called a greatest lower bound (gib) of {a, 6 } if 

(i) d is a lower bound of {a, 6 } and 

(ii) if c E S is a lower bound of {a, 6 }, then c < d. 

Remark 1.4.13 (i) In a poset (S, <), a subset {a, 6 } of S m,ay not have a 
lower bound. 

(n) In a poset (S, <)., a subset {a , 6 } of S m,ay have m.ore than one lower 
bound. 

(iii) In a poset (S', <), a subset {a, 6 } of S m.a.y not have a gib. 

(iv) In a poset (S, <), if a, subset {a, 6 } of S has a gib, then this gib is 
unique. 
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Notation: The gib of {a, 6} in (S, <), if it exists, is denoted by a A b. 

A useful device in the study of posets is the poset diagram. Let (5, <) be 
a poset and x,y £ S. We say that y covers x, denoted by y y x, if x < y, 
x ^ y, and there are no elements z € S such that x < z < y, x ^ z, z ^ y. We 
represent the elements of S by the elements themselves in the plane such that 
if x < y, then y occurs above x, and we connect x with y by a line segment 
if and only if y covers x. The resulting diagram is called the poset diagram of 

(s,<). 

Example 1.4.14 Let S = {1,2,3}. Then 

V(S) = {0, { 1 }, { 2 }, {3}, { 1 , 2 }, { 2 ,3}, {1,3}, S}. 

Now (V(S), <) is a, poset, where < denotes the set inclusion relation. The 
poset diagram, of (fP(S), <) is given below. 




Definition 1.4.15 Let (5, <) be a poset. An element u £ S is called a max¬ 
imal (minimal ) element of S if there is no element v £ S such that u < v 
(v < u ) and u ^ v. 

Example 1.4.16 Let S = {1,2,3} and T be the set of all proper nonempty 
subsets of S. Now (T, <) is a poset, where < is the set inclusion relation. In 
this poset { 1 }, {2}, a,nd, {3} are minimal elements and {1,2}, {1, 3}, {2, 3} are 
m,a,xim,al elements. 

Next, we state the following fundamental axiom of set theory. There are 
several places in this text, where we will use it very effectively. 

Zorn’s Lemma: If every chain in a poset (S', <) has an upper bound in S, 
then S contains a maximal element. 

We have seen several examples of posets in which lub (gib) need not exist. 
Next, we study those posets for which lub (gib) exists. 
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Definition 1.4.17 A poset (L, <) is called a, lattice if a A 6 and a\/b exist in 
L for all a,b € L. 

Example 1.4.18 Let L — [0,1] — {x € R | 0 < x < 1 }. Then (L, <) is a poset, 
where < denotes the usual “less than or equal to” relation. Let a,b G [0,1]. Now 
max{a, b} 6 L and, min{a, b } £ L. It is easy to see that max{a, 6 } is the lub of 
{a, 5} and min{a, 6 } is the gib of {a,b}. For example, max{.2, .3} = .3 = .2V.3 
and min{.2, .3} = .2 = .2 A .3. Hence, (L, <) is a lattice. 

Example 1.4.19 Let S be a set. Then ( V(S ),<) is a poset, where < is the 
set inclusion relation. For A,B 6 V(S), we can show that i V B — A U B and 
A A B = A fl B. Hence, (fP(S), <) is a lattice. 

In the following theorem, we collect several useful properties of a lattice. 

Theorem 1.4.20 Let (L, <) be a lattice and a,b,c£ L. Then 
(LI) aV b = bV a, a A b = b A a (commutative laws), 

(L2) a V (6 V c) = (a V b) V c, a A (6 A c) = (a A b) A c (associative laws), 
(L3) a V a = a, a f\a — a (idem,potent laws), 

(Lf) a V (a A b) = a, a A (a V 6) = a (absorption laws). 

Proof. (Ll) a V6 = lub of {a, 6 } = lub of {6, a} — by a. Note that the proof 
follows from the fact that the set {a, b} is the same as the set {b, a}. 

We leave the remainder of the proof to the exercises except for L4. 

(L4) Now a < a and a/\b < a. Hence, a is an upper bound of {a, aA6}. Thus, 
by the definition of least upper bound, a V (a A b) < a. Since a V (a A b) is the 
lub of {a, aA&}, a < aV(aA&). Hence, a = aV(aA&) since < is antisymmetric. ■ 

The proof of the following result is left as an exercise. 

Theorem 1.4.21 Let (S, <) be a poset and a,b E S. Then the following con¬ 
ditions are equivalent. 

(i) a < b. 

(ii) a V 6 = b. 

(Hi) a A b = a. 

Definition 1.4.22 A lattice (L, <) is called a modular lattice if for all a, b, c G 
L, a < c implies 

aV(bAc) = (a V b) A c. 

The lattices defined in Examples 1.4.18 and 1.4.19 are modular lattices. 

“Pufuc. 
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Example 1.4.23 Consider the lattice given by the following diagram 



Here a < c, but aV(&Ac) = aVO = a^c = (a V 5) A c. Hence, this lattice 
is not m,odula.r. 

Definition 1.4.24 A lattice (L, <) is called distributive if it satisfies 
(D1) a A [b V c) = (a A b) V (a A c) 
for all a, b, c € L. 

The lattices defined in Examples 1,4.18 and 1.4.19 are distributive lattices. 

Theorem 1.4.25 A lattice (L, <) is distributive if and only if 
(D2) a V (6 A c) = (a V b) A (a V c) 

/or all a,b,c E L. 

Proof. Suppose (L, <) is distributive. Let a, 6, c E L. Then 

(a V 6) A (a V c) = ((a V b) A a) V ((a V 6) A c) by D1 

= (a A (a V &)) V ((a V 6) A c) by LI 


= a V ((a V b) A c) by L4 

= a V (c A (a V b)) by LI 

= a V ((c A a) V (c A 6)) by D1 

= (a V (c A a)) V (c A b) by L2 

= (a V (c A a)) V (b A c) by LI 

= a V (6 A c) by L4. 


Hence, a V (6 A c) = (a V 6) A (a V c). Similarly, D2=>D1. ■ 
Theorem 1.4.26 Every distributive lattice is a. modular lattice. 


Proof. Let ( L , <) be a distributive lattice and a,b,c £ L be such that a < c. 
Then aV(6Ac) = (aV6)A(aVc) = (aV6)Ac. Hence, (L, <) is a modular lattice. ■ 


Theorem 1.4.26 says that every distributive lattice is a modular lattice. 
However, the converse of this result is not true, as shown by the following 
example. 
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Example 1.4.27 Consider the lattice given by the following poset 


a 



This is a modular lattice, but not a distributive lattice since a V (b A c) = 
a V 0 = a 7^1= (a V 6) A ( a V c). 

Theorem 1.4.28 In a distributive lattice (L, <), 

a A b = a A c and a V b — a V c imply that b — c 



for all a, b, c 6 L. 

Proof. Now b = bA( aWb) = 6 A ( a Vc) = (b Aa) V(6Ac) = (a Ac) V (6Ac) = 
(c A a) V (c A b) — c A (a V 6) = c A (a V c) = c. ■ 

1.4.1 Worked-Out Exercises 

0 Exercise 1 Suppose that in a poset (P, <), a A 6, 6 Ac, and a A (6 Ac) exist, 
where a, 6, c € P. Show that (a A b) A c exists and a A (6 A c) = (a A 6) A c. 

Solution: Now a A (6 A c) < a, a A (b A c) < b A c, b A c < 6 , and 6 A c 
< c. Hence, a A (6 A c) is a lower bound of a, 6 . Since a A 6 exists, we find that 
a A (b A c) < a A b. Also, a A (6 A c) < c. Hence, a A (6 A c) is a lower bound of 
{a A6, c}. Let d be a lower bound of {a A b, c}. Then d < aAb and d < c. Thus, 
d < a, d < 6, and d < c. Since 6 A c exists, d < b A c. Also, a A (6 A c) exists. 
Hence, d < a A (b Ac). Thus, a A (6 A c) is the gib of {a A b, c}. Consequently, 
(a Ab) Ac exists and a A (b A c) = (a Ab) Ac. 

Exercise 2 Show that every chain is a distributive lattice. 

Solution: Let (L, <) be a chain and a, b, c G L. Since L is a chain, either 
a < b or b < a. If a < 6, then a V 6 = b and a A b = a. If 6 < a, then a V 6 = a 
and a A b = b. Hence, for any two elements a,b e L, a A b and a V b exist in L. 
Suppose a < b. 

Case 1: b < c. 

Now a A (6 V c) = a A c = a and (a A b) V (a A c) = a V a = a. Hence, we 
have a A (b V c) — (a A b) V (a A c). 

Case 2: c <b. 

Subcase 2a: a < c. 
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In this case, we have a < c < b. Now a A ( b V c) = aAb - a and 

(a A b) V (o A c) — a V a = a. Hence, a A ( bVc)= (a A b) V (a A c). 

Subcase 2b: c < a. 

In this case, we have c < a < b. Now a A ( b V c) — a A b — a and 

(a A b) V (a A c) = a V c = a. Hence, a A ( b V c) = (a A 6) V (a A c). 

Similarly, if 6 < a, then a A ( 6 V c) — (a A b) V (a A c). 

Exercise 3 In a lattice (L, <), prove that (a A 6) V (a A c) < a A (6 V (a A c)) 
for all a, 6, c E L. 

Solution: a Ab < a, a Ac < a. Hence, (a A b) V (a A c) < a. Again aAb<b 
implies (a A 6) V (a A c) < bV(aAc). Thus, we find that (a A 6) V (a A c) is a 
lower bound of {a, b V (a A c)}. But a A (b V (a A c)) is the gib of {a, b V (a A c)}. 
Hence, (a A 6) V (a A c) < a A (b V (a A c)). 

Exercise 4 Prove that a lattice (L, <) is modular if and only if (aAb)y(aAc) = 
a A (by (a A c)) for all a, 6, c E L. 

Solution: Suppose (L, <) is modular. Then 

(a A b) V (a A c) = (a A c) V (a A b) 

= (a A c) V (6 A a) 

= ((a A c) V b) A a (by modularity since a A c < a) 

= a A (bW (a Ac)). 

Conversely, suppose that (aAb) V(aAc) — a A (by (a Ac)) for all a, 6, c E L. Let 
a, b, c E L be such that a < c. Then a Ac — a. Now (cAfr)V(cAa) — cA(6V(aAc)). 
Hence, (c A b) V a — c A (b V a), i.e., a V (b A c) — (a V b) A c. 

1.4.2 Exercises 

1. Draw the poset diagram for each of the following posets. 

(i) ({a [ a is a positive divisor of 20} ,<), where < denotes the divisibility 
relation. 

(ii) (N, <), where < denotes the natural order relation. 

(iii) (V(S) , <), S — {1,2,3,4}, where < denotes the set inclusion relation. 

(iv) (P(S)\{<£},<), S — {1, 2 ,3}, where ^ denotes the set inclusion re¬ 
lation. 

2. Give an example of a relation R which is antisymmetric, but not reflexive. 

3. Give an example of a poset (P, <) such that P has two elements a and b 
for which aAb does not exist. 
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4. Show that (R, <) is not a poset, where a < b means that b = ad for some 
d G R. 

5. Let <i and <2 be two partial orders on a set S. Is <1 D <2 a partial 
order on S7 

6 . Let (A, < 1 ) and (B, < 2 ) be two posets. Prove that ( A x B, <) is a poset, 
where (a, 6) < (c, d) if and only if a <1 c and b <2 d. 

7. Let (P, <) be a poset and a, b, c £ P. 

(i) If a V b } b V c, and a V ( b V c) exist, show that (aV 6) V c exists and 
aV(6Vc) = (aV 6) V c. 

(ii) If a V 6 exists, prove that a V (a V 6) exists and aV6 = aV(aV6). 

8 . Which of the following posets are lattices? 

a 

a 


b 



c 


d 

( i ) (ii) (Hi) 

9. Let D(40) denote the set of all positive divisors of 40. Consider the lattice 

(D(40), <), 

where < denotes the divisibility relation. Find 4 A ( 8 V 10) and (2 V (2 A 
8)) V 20. 

10. In a lattice (L, <), prove the following. 

(i) a V (b A c) < (a V b) A (a V c), 

(ii) (a A b) V (a A c) < a A (6 V c), 

(iii) (a A 6) V (6 A c) V (c A a) < (a V b) A (6 V c) A (c V a), 

(iv) if a < c, then a V (6 A c) < (a V 6) A c, 
for all a, 6, c £ L. 
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11. In a modular lattice (L, <), prove that for all a,b,c £ L, a < c, aAb — cAb, 
and a V b — c V b imply that a — c. 

12. Prove that a lattice (L, <) is distributive if and only if for all a,b,cE L, 

(a A 6) V (b A c) V (c A a) = (a V b) A (6 V c) A (c V a). 

13. Determine whether the following assertions are true or false. If true prove 
the result; and if false give a counterexample. 

(i) The relation R = {(a, b) <E Z x Z | |a — 6 | < 1} is a partial order on 
Z. 

(ii) The relation R = {(a, b) € Z x Z | |a| < ) 6 |} is a partial order on Z. 

(iii) The relation R — {(a, b) G 5x5 | a divides 6 in N} is a partial order 
on S = {1,2,3,4,6,12}. 


1.5 Functions 

Like sets, functions play a central role in mathematics. Readers may already 
be familiar with the notion of a function either through a college algebra or a 
calculus course. In these courses, functions were usually real valued. Throug¬ 
hout the text we will encounter functions which do not have to be real valued. 
Functions help us study the relationship between various algebraic structures. 
In this section, we review some of their basic properties. Roughly speaking, a 
function is a special type of correspondence between elements of one set and 
those of another set. More precisely, a function is a particular set of ordered 
pairs. 

Definition 1.5.1 Let A and B be nonempty sets. A relation f from A into B 
is called a function (or mapping) from, A into B if 

(i) V{f) = A and 

(ii) for all (x,y), (x'^y') € /, x = x' implies y — y'. 

When (ii) is satisfied by a relation /, we say that f is well defined or 
single-valued. 


We use the notation / : A —> B to denote a function / from a set A into a 
set B. For (x, y) € /, we usually write f(x) = y and say that y is the image 
of x under / and x is a preimage of y under f. 

Leibniz seems to be the first to have used the word “function” to stand for 
any quantity related to a curve. Clairant (1734) originated the notation f(x) 
and Euler made extensive use of it. Dirichlet is responsible for the current 
definition of a function. 
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Let us now explain the above definition. Suppose f : A —* B. Then / is 
a subset of A x B such that for all x G A, there exists a unique y G B such 
that (x, y ) G f. Hence, we like to think of a function as a rule which associates 
to each element x of A exactly one element y of B. In order to show that a 
relation / from A into B is a function, we first show that the domain of f is A 
and next we show that f well defined or single-valued, i.e., if x = y in A, then 
f(x) = f(y) in B for all x,y G A. 

We now consider some examples of relations, some of which are functions 
and some of which are not. 


Example 1.5.2 Let f be the subset of Z x Z defined by 

f — {(n, 2n + 3) | n G Z}. 

Then D(f) = {n \ n € Z} = Z. We now show that f is well defined. Let 
n ,m G Z. Suppose n = m. Then 2n + 3 = 2m + 3, i.e., f(n) = f(m). Therefore, 
f is well defined. Hence, f satisfies (i) and (ii) of Definition 1.5.1 and so f is 
a function. 

Example 1.5.3 Let A — {1,2,3,4} and B = {a, 6 , c}. Let f be the subset of 
A x B defined by 

/ = {(l,a),(2,6),(3,c),(4,6)}. 

First note that T>(f) = {1,2,3,4} — A and so f satisfies (i) of Definition 
1.5.1. From the definition of /, it is immediate that for all x G A, there exists 
a unique y G B such that {x,y) G f. Therefore, f is well defined and so f 
satisfies (ii) of Definition 1.5.1. Hence, f is a function. 


Example 1.5.4 Let f be the subset of Q x Z defined by 

f = I Pi<l € Z > <7^0}. 

First we note that D(f) = {^ j p, q € Z, q ^ 0} = Q. Thus, f satisfies (i) of 

Definition 1.5.1. Now (|,2) 6 /, (§,4) G / and § = But /(|) = 2^4 = 
/(|). Thus, f is not well defined. Hence, f is not a. function from Q into Z. 

Example 1.5.5 Let f be the subset ofZxZ defined by 


f — {(mn, m + n) \ m, n € Z}. 


First we show that f satisfies (i) of Definition 1.5.1. Let x be any element of 
Z. Then we can write x = x • 1. Hence, (x, x + 1) = (x • l,x + 1) 6 /. This 
implies that x G D(f). Thus, Z C D(f). However, D(f) C Z and so D(f) = Z. 
Thus, f satisfies (i) of Definition 1.5.1. Now 4 G Z and 4 = 4 • 1 = 2 • 2. Thus, 
(4 • 1,4 + 1) G / and (2 • 2, 2 + 2) G /. Hence, we find that 4 • 1 = 2 • 2 and 
/(4 • 1) = 5 ^ 4 = /(2 - 2). 77ms implies that f is not well defined, i.e., f does 
not satisfy (ii) of Definition 1.5.1. Hen ce, f is not a function from Z into Z. 
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We now explore the meaning of equality of two functions. 

Let f : A B and g : A —*■ B be two functions. Then / and g are subsets of 
AxB. Suppose f — g. Let x be any element of A. Then (x, f{x )) E / = g. Also, 
(x,g(x)) E g. Since g is a function and (x, /(x)), (x, g(x)) E g, we must have 
g{x) = /(x). Conversely, assume that g{x) = fix) for all x E A. Let (x,y) E /. 
Then y = f(x) = g{x). Thus, ( x,y ) E g. This implies that / C g. Similarly, we 
can show that g C /. It now follows that f = g. Thus, two functions / : A —> B 
and g : A —► B are equal if and only if /(x) = g(x) for all x E A. 

Example 1.5.6 Let f : Z —» Z# and g : Z —> Z^ be defined by f — {(n,n 2 ) | 
n E Z} and g — {(n, |n| 2 ) | n E Z}. Now for all n E Z, 

/(n) = n 2 = |n| 2 = p(n). 


Hence, f = g. 

Definition 1.5.7 Let f be a function from, a set A into a set B. Then 

(i) f is called one-one if for all x,x' E A , f{x) — fix') implies x = x'. 

(ii) f is called onto B (or f maps A onto B) if T[f) = B. 

We note that if / : A —> B, then T{f) = B if and only if for all y E B, 
there exists x E A such that f(x) = y. In other words, X{f) = B if and only if 
every element of B has a preimage. We also note that / is one-one if and only 
if every element of B has at most one preimage. 

Let A be a nonempty set. The function '■ A —> A defined by ia(x) — x 
for all x E A is a one-one function of A onto A. i j 4 is called the identity map 
on A. 

Example 1.5.8 Consider the relation f from. 7i into Z defined by 

f{n) = n 2 

for all n E Z. Now T>(f) = Z. Also, if n — n ', then n 2 = (n') 2 , i.e., f{n) = 
f{n'). Hence, f is well defined. Thus, f is a function. Now /(1) = 1 — /(—1) 
and 1 7^ —1. This implies that f is not one-one. Now for all n E Z, /(n) 
is a nonnegative integer. This shows that a negative integer has no preim,age. 
Hence, f is not onto Z. Note that f is onto {0,1,4, 9,...}. 

Example 1.5.9 Consider the relation f from. Z into Z defined by for all n E Z, 
/(n) = 2n. As in the previous examples, we can show that f is a function. Let 
n, 77/ E Z and suppose that f(n) = f(n'). Then 2 n — 2 n', i.e., n = n!. Hence, 
f is a one-one function. Since for all n E Z, f(n) is an even integer, we see 
that an odd integer has no preim.age. Thus, f is not onto Z. However, we note 
th,a,t f is onto E. 
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Definition 1.5.10 Let A, B, a,nd C be nonempty sets and f : A —> B a.nd 
g : B —> C. The composition o of f and g , written g o /, is the relation from. 
A into C defined as follows: 

g o f = {(x, z) | x £ A, z £ C, there exists y £ B 

such that f(x) = y and g(y) = z}. 

Let / : A —*■ B and g : B —> C and (x,z) £ g o /, i.e., (g o f)(x) = z. 
Then by the definition of composition of functions, there exists y £ B such 
that /(x) — y and g(y) — z. Now 

z = 9(y ) = p(/(z)). 


Hence, (g o f)(x) = g(f{x)). 

In the following, we describe some properties of composition of functions. 

Theorem 1.5.11 Suppose that f : A —» B a.nd g : B —» C. Then 

(i) g o f ; A —> C, i.e., g o / is a function from, A into C. 

(it) If f and g are one-one, then g o f is one-one. 

(Hi) If f is onto B and g is onto C , then g o f is onto C. 

Proof. (i) Let x G A. Since / is a function and x 6 A, there exists y € B 
such that f(x) — y. Now since g is a function and y £ B, there exists z € C 
such that g(y) = z. Thus, (g o f)(x) = g{f(x)) = g(y) = z, i.e., (x,z) ego f. 
Hence, x 6 V{g o /). This shows that A C Z>(<? o /). But Z)(p o /) C A and so 
V(g ° f) = A. Next, we show that g o / is well defined. 

Suppose that (x, z) € go /, (xi, zi) £ go f and x = xi, where x, xi £ ^4 and 
z, z\ £ C. By the definition of composition of functions, there exist y,y\ £ B 
such that /(x) = y , p(j/) = z, /(xi) = y\ and g(y\) = zi. Since / is a function 
and x = Xi, we have y = y\. Similarly, since g is a function and y = y\, we 
have z = z\. Thus, g o / is well defined. Hence, g o / is a function from A into 

C. 

(ii) Let x,x' £ A. Suppose (gof)(x) = (^o/)(x'). Then g(f(x)) = g(f(x')). 
Since g is one-one, fix) = fix'). Since / is one-one, x = x' . Thus, g o f is one- 
one. 

(iii) Let z £ C. Then there exists y £ B such that g[y) = z since g is 
onto C. Since / is onto B , there exists x £ A such that f(x) = y. Thus, 
(g 0 f){x) = p(/(x)) = g(y) = z. Hence, go f is onto C. ■ 

Example 1.5.12 Consider the function f : Z —> Z and, g : Z —» E, where 
fin) — n 2 and p(n) = 2n /or all n £ Z. 77?,en g o f : 7i —» E and, (g o f){n) = 
g(f{n)) = g(n 2 ) = 2 n 2 . 
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Theorem 1.5.13 Let f : A —> B, g : B —> C, and h : C —» D. Then 

ho (g o f) = (ho g) o f. 

That is, composition of functions is associative. 

Proof. First note that h o (g o f) : A —> D and (h o g) o f : A —> D. Let 
x e A. Then 

[ho(gof)}(x) = h((gof)(x)) = h(g(f(x))) = (hog)(f(x)) = [(h o g) o f](x). 
Thus, by the equality of two functions, h o (g o /) = (h o g) o f. ■ 

Let A be a set and / : A —> A. Recursively, we define 

f 1 ^) = / 0 ) 

f n+l (x) = (fof n )(x) 


for all x 6 A, n E N. 

Let A and B be sets. A and B are said to be equipollent, written A ~ B, 
if there exists a one-one function from A onto B , i.e., the elements of A and B 

are in one-one correspondence. 

From Theorem 1.5.11, it follows that ~ is an equivalence relation. If A ~ B, 
then sometimes we write \A\ = \B \ . It is immediate that if A and B are finite 
sets, then \A\ = |B| if and only if A and B have the same number of elements. 

The following lemma, which follows from Theorem 1.5.11(ii), is of indepen¬ 
dent interest. We give a direct proof of this result. 

Lemma 1.5.14 Let A be a set and f : A —> A be a one-one function. Then 
f n :A^Aisa one-one function for all integers n > 1 . 


Proof. Suppose there exists n > 1 such that f n is not one-one. Let m > 1 
be the smallest positive integer such that f m is not one-one. Then there ex¬ 
ist x, y E A such that x ^ y and / m (x) = / m (y). But then /(/ m_1 (x)) = 
f(f m ~ 1 (y )) and hence / m- 1 (x) = / m- 1 (y) since / is one-one. Now since m 
is the smallest positive integer such that / m is not one-one, / m_1 is one-one. 
Hence, x = y, which is a contradiction. Thus, f n is one-one for all n > 1. ■ 

That one-one functions on a finite set are onto is proved next. 


Theorem 1.5.15 Let A be a finite set If f : A —> A is one-one, then f is 
onto A. ii. 
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Proof. Let y E A. Now f n (y) E A for all n > 1. Hence, 

{yJ(y)J 2 {y),- • ■} Q A. 

Since A is finite, all elements of the set {y, f(y ), f 2 (y ),...} cannot be distinct. 
Thus, there exist positive integers s and t such that s > t and f 8 (y) = f\y). 
Then *(?/)) = /%). Hence, f s ~\y) = y since by Lemma 1.5.14, /* is 

one-one. Let x = f s ~ t ~ 1 {y) E A. Then /(x) = y. Hence, / is onto A. ■ 

Definition 1.5.16 Let A o,nd B be sets and f : A —*■ B. 

(i) f is called left invertible if there exists g : B —> A such that 

go f = i A . 

(a) f is called right invertible if there exists h : B —► A such that 

foh — i B . 


A function / : A —» B is called invertible if / is both left and right 
invertible. 


Example 1.5.17 Let f : Z —» Z and g : Z —> Z be as defined below. 


9(n) 


for all n E Z. Now 

(/ ° 9 ) O) 


/(n) = 3 n 

^ if n is a multiple of 3 
0 if n is not a multiple of 3 

/OM) 

{ n if n is a multiple of 3 
0 if n is not a multiple of 3. 


Hence, f o g ^ ij,. But (g o f)(n) = g(f(n)) = g(3n ) = n for all n E Z. Thus, 
9 ° f = iz- Hence, g is a left inverse of f. 


Often we are required to find a left (right) inverse of a function. However, 
not every function has a left (right) inverse. Thus, before we attempt to find 
a left (right) inverse of a function, it would be helpful to know if a given 
function has a left (right) inverse or not. The following theorem is very useful 
in determining whether a function is left (right) invertible or invertible. 

Theorem 1.5.18 Let A and B be sets and f : A —► B. Then the following 
assertions hold. 

(i) f is one-one if and only if f is left invertible. 

(n) f is onto B if and only if f is right invertible. 

(Hi) f is one-one and onto B if and only if f is invertible^. 

“Public. TfeafAewtafLta/ 
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Proof. (i) Suppose / is left invertible. Then there exists g : B —> A such 
that gof — i A . Let x, y E A be such that f(x ) = /(y). Then g(f(x)) ~ p(/(y)) 
or (go f)(x) = (po/)(y). Hence, m(x) = i A (y), be., x = p. Thus, / is one-one. 

Conversely, suppose / is one-one. Then for y E B, either y has no preimage 
or there exists a unique x y E A such that f(x y ) = y. Fix x E A. Define 
g : B -> A by 


9(y) = 


x if y has no preimage under / 

x y if y has a preimage under / and f(x y ) — y 


for all y E B. By the definition of p, V(g) = B. To show p is well defined, 
suppose p,p' E B and y — y' . Then either both y and y' have no preimages or 
there exist unique x y ,x y > E A such that f(x y ) = y and f(x y >) = y' . Suppose 
both y and y r have no preimages. Then g(y) — x — g(y'). Now suppose there 
exist unique x y , x y > E A such that f(x y ) =■ y and f(x y >) — y' . Thus, g(y) = x y 
and g(y') = x' y . Since y — p', we have f(x y ) — f(x y >). Since / is one-one, 
x y — x y ' and so g(y) — g(y'). We have thus shown that p is well defined and so 
p is a function. We now show that gof —%a- Let u E A and suppose f(u) = v 
for some v E B. Then by the definition of p, g(v) = u. Thus, 


(9 0 f)(v) = g(f(u )) = g(y) = u = i A (u). 


Hence, gof = i A . 

(ii) Suppose / is right invertible. Then there exists p : B —>■ A such that 
/ o p = i B . Let y E B. Let x = g(y) E A. Now y = i B (y ) = (/ o g)(y) = 
f{g(y)) = f( x )- Hence, / is onto B. 

Conversely, suppose / is onto B. Let y E B. Since / is onto, there exists 
x E A such that f(x) — y. Let A y = {x E A \ f(x) = y}. Then A y 7 ^ <f. Choose 
x y E A y for all y E B. Define h : B —> A such that h(y) — x y for all y E B. 
Then h is a function. Let y E B. Then (/ o h)(y) = f(h(y )) = f(xy) — y — 
i B (y)- Hence, f oh — i B and so / is right invertible. 

(iii) The result here follows from (i) and (ii). ■ 


Let / : A — » B be invertible. Let p be a left inverse of / and h be a right 
inverse of /. Then p o / — i A and f o h = i B - Now g = g o i B — g o (f oh) = 
(g 0 f) o h — i A o h — h. Thus, if / is invertible, then left and right inverses of 
/ are the same. This also proves that the inverse of a function, if it exists, is 
unique. 

If / is an invertible function, then the inverse of / is denoted by f~ 1 . 

Let f : A—> B and A' C A. Then / induces a function from A' into B in a 
natural way as defined next. 

Definition 1.5.19 Let f : A —> B and A! be a nonempty subset of A. The 
restriction of f to A) written f\ A ', is defined to be 

f\ A ' = {(x'J(x')) I x' e A'}. 
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We see that f\^ is really the function / except that we are considering / 
on a smaller domain. 

Definition 1.5.20 Let f : A' —> B and A be a set containing A'. A function 
g : A —> B is called an extension of f to A if g\ a 1 — /• 

Example 1.5.21 Consider the function f : E —> Z and g : Z —* Z, where 
f(2n) = 2n + 1 and g(n) = n*f 1 for all n £ Z. Then g is an extension of f to 
Z a,nd f is the restriction of g to E. Let the function h : Z —> Z 6e defined by 
for all m € Z, h(m) = m + 1 if m £ E and /i(ra) = m if m ^ E. Then h is 
a,n extension of f to Z. However, h ^ g. Thus, a function may have m,ore than 
one extension. 

In Section 1.1, we defined the Cartesian cross product, Ax B, of two sets A 
and B. We now extend this notion to a family of sets {A a \ a £ /}, where / is an 
index set. First let us make the following observation: Suppose I = {1,2}. Let 
S be the set of all functions / : I —» A U B such that /( 1 ) £ A and /( 2 ) 6 B. 
Then every function f £ S defines an ordered pair (/(l),/(2)) £ A x B. 
Conversely, given x £ A and y £ B, define f £ S by /( 1 ) = x and /(2) = y. 
Then the ordered pair ( x,y ) defines a function f £ S. Hence, there is a one- 
one correspondence between the elements of S and A x B. We now define the 
Cartesian product of {A a \ a £ I}. 

Let {A a | a £ 1} be a family of sets. The Cartesian (cross) product 
of {A a | a £ I}, denoted by Flae/ A a , is defined to be the set 

{/!/:/—»■ U aelA a and f(a) £ A a for all a £ /}. 

Let / E Then f(a) £ A a for all a £ I. Let us write /(a) = x a for 

all a £ I. We usually write (x a ) a <zj for /, i.e., a typical member of ria€/ is 
denoted by (x a ) a£ i, where x a £ A a for all a £ I. 

Suppose I = {1,2, ...,n} is a finite set. Then the Cartesian product 
Yiizi is denoted by A\ x A<i x • • • x A n . A typical member of A\ x A 2 x 

• • • x A n is denoted by (x\, 12 , • • •, x n ), X{ £ Ai for all i = 1,2,..., n. The 
elements of A\ x A 2 x • • • x A n are called ordered n-tuples. For two ele¬ 
ments (xi,x 2i ■ ■ ■ ,x n ), (yi,y 2 , ■ ■ • ,2/n) £ A x x A 2 x ■ ■ • x A n , (ar 1 ,x 2 ,..., x n ) = 
{yiAJ 2 , ■ ■ ■ ,Vn) if and only if x x = y % for all i. 


1.5.1 Worked-Out Exercises 

0 Exercise 1 Determine which of the following mappings / : R —> R are 
one-one and which are onto R : 

(i) f(x) = x + 4, 

(ii) f(x) = x 2 
for all x £ R. 
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Solution: (i) Let i,i/eR. Suppose f(x) = f{y). Then i + 4 = i/ + 4 or 
x — y. Hence, / is one-one. Now / is onto R if and only if for all y £ R there 
exists x £ R such that /(x) = y. Let y € R. If f(x) = y ) then x + 4 = y or 
x = y — 4. Also, y — 4 £ R. Thus, we can take x to be y — 4. Now f(y — 4 ) = 
y — 4 + 4 — y. Hence, / is onto R. 

(ii) We note that f(x) is a nonnegative real number for all x £ R. This 
means that negative real numbers have no preimages. In particular, for all 
x £ R, f(x) — x 2 ^ —1. Hence, / is not onto R. Also, /(—1) = 1 — /(1) and 
-1 ^ 1. Thus, / is not one-one. Thus, / is neither one-one nor onto R. 


0 Exercise 2 (i) Let / : Z —> Z be a mapping defined by 


/0) 


i if x is even 
2 x + 1 if x is odd 


for all x £ Z. Find a left inverse of / if one exists. 

(ii) Let / : Z —» Z be the mapping defined by f(x) — \x | + x for all x £ Z. 
Find a right inverse of / if one exists. 


Solution: (i) By Theorem 1.5.18, / has a left inverse if and only if / is 
one-one. Before we attempt to find a left inverse of /, let us first check whether 
/ is one-one or not. Let x,y £ R and /(x) = f(y). By the definition of /, /(x) 
is even if x is even and f{x) is odd if x is odd. Thus, since f(x) — f(y), we have 
both x and y are either even or odd. If x and y are both even then fix) — x 
and f(y) = y and so x — y. Suppose x and y are odd. Then /(x) — 2x +1 and 
/( y ) — 2y T 1. Then 2x -f 1 = 2y + 1 or x = y. Hence, / is one-one and so / has 
a left inverse. Thus, there exists a function g : Z —> Z such that go f = Let 
x £ Z. Suppose x is even. Now x — izix) = (<? o /)(x) = <?(/(x)) = <?(x). This 
means g(x) — x when x is even. Now suppose x is odd. Then x = iz^) — 
(<7 o /)(x) — g(f(x)) — g{2x + 1 ). Put i = 2 x + 1 . Then x — This shows 
that g(x) — if x is odd. Thus, our choice of g is 


9{x) 


x 

x—1 
2 


if x is even 
if x is odd. 


(ii) Note that /(x) = |x| + x > 0 for all x £ Z. This shows that negative 
integers do not belong to T(f). In particular, /(x) 7 ^ —1 for all x £ Z. Thus, 
/ is not onto Z and so / does not have a right inverse. 


<) Exercise 3 Let X and Y be nonempty sets and / : X — > Y. If T C X, then 
f(T) denotes the set {/(x) | x £ T}. f(T) is called the image of T under 
/. Prove that / is one-one if and only if 

f{A n B) = f(A) n /(B) 

for all nonempty subsets A and B of X. 

“Pufuc. 
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Solution: Suppose that / is one-one. Let A and B be nonempty subsets of 
X. Let y E /(AflB). Then y = /(x) for some x E AflB. Hence, y E /(A)Pl/(B). 
Thus, /(A n B) C f(A) n /(B). Now let y E /(A) n /(B). Then y E /(A) and 
y E /(B). Thus, y = /(a) for some a E A and y = f(b ) for some b E B. Since 
/ is one-one and /(a) = /(b), we find that a = b. Thus, y E /(A n B). Hence, 
/(A) n /(B) C /(A n B). Consequently, /(A n B) = /(A) n /(B). 

Conversely, suppose that /(AflB) = /(A)D/(B) for all subsets A and B of 
X. Suppose / is not one-one. Then there exist x,y E X such that fix ) = /(y) 
and x ^ y. Let A = {x} and B = {y}. Since AflB = /(AflB) — /. However, 
/(A) fl /(B) = {/(x)} 7 ^ </. Thus, /(A n B) ^ /(A) fl /(B), a contradiction. 
Hence, / is one-one. 

0 Exercise 4 Let A be a nonempty set and E be an equivalence relation on 
A. Let B = {[x\ | x E A}, i.e., B is the set of all equivalence classes with 
respect to E. Prove that there exists a function / from A onto B. The 
set B is usually denoted by A/E and is called the quotient set of A 
determined by E. 

Solution: Define / : A — » B by /(x) = [x] for all x E A. By the definition 
of /, £>(/) = A. Let x, y E A. Suppose x = y. Then [x] = [y] and so /(x) = /(y). 
Thus, / is well defined. Let [a] E B. Then a E A and /(a) = [a]. Hence, / is 
onto B. 


Exercise 5 Let 5 = {x E R | —1 < x < 1}. Show that R ~ S. 
Solution: Define / : R —> S by 

f( x ) 


x 


1 + X 


for all x E R. Let x E R. Then — |x| < x < |x|, —1 — |x| < — |x| , and 


x| < 1 T |x|. Hence, —1 — |x| < x < 1 + |x|. Thus, —1 < 


1+ X 


< 1 and so 


— 1 < /(x) < 1. This shows that /(x) E S. Let x, y E R and /(x) = /(y). Then 
rfjij = Thus, = yM_. This implies that |x| + |x| |y| = |y| + |x| |y| 

and so |x| = |yj. Now implies that x > 0 if and only if y > 0. 

Therefore, since |x| = |y| , x = y. Thus, / is one-one. 

Now let z E R and — 1 < z < 1. If 0 < 2 < 1, then 


/(t— ) = 


1 —z 


1 — z 


1-Z 


1 + 


1-2 


1 + 


= 2 . 


1—z 


If — 1 < z < 0, then 


/(rb) = 


1+z 


1 + 2 


l + z 


1 + 


1 + Z 


1 + 


= 2 . 


1+z 


Hence, / is onto R. Consequently, R ~ S. 
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1.5.2 Exercises 

1. Determine which of the following mappings / :R->R are one-one and 
which are onto R : 

(i) f(x) = x + 1, 

(ii) f(x) = x 3 , 

(hi) f(x) = \x\+x 
for all x € R. 


2. Consider the function / = {(x,x 2 ) | x € 5} of S — {—3, —2, —1, 0, 1, 2, 
3} into Z. Is / one-one? Is / onto Z? 

3. Let / : R + —* R + and g : R + —* R + be functions defined by fix) = yfx 
and g{x ) = 3:r + 1 for all x 6 R + , where R + is the set of all positive real 
numbers. Find fog and gof.Isfog = gof? 

4. Let / : Q + —> R and g : R —► R be defined by f(x) = 1 + ^ for all 
x 6 Q + and g{x) = x + 1 for all x G R, where Q + is the set of all 
positive rational numbers. Find g o /. 


5. For each of the mappings / : Z —> Z given below, find a left inverse of / 
whenever one exists. 


(i) /0) = x + 2, 


(ii) f{x) = 2x, 


(iii) f( x ) 


| if a: is even 
5 if x is odd 


for all x E Z. 


6. For each of the mappings / : Z 
/ whenever one exists. 

(i) f(x) = x-3, 

(ii) /(:r) = 2 x, 


Z given below, find a right inverse of 


/•••\ e/ \ i x if oc is even 
(ln) f(x) = X + 1 if X is odd 


for all xeZ. 


7. Let A — {1, 2,3}. List all one-one functions from A onto A. 


8. Let A — {1,2,... ,n}. Show that the number of one-one functions of A 
onto A is n\ 

“Puiic. P4y.i.ZcJ. 
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9. Let / : A —> B be a function. Define a relation R on A by for all a,b G A, 
aRb if and only if /(a) = /(&). Show that R is an equivalence relation. 

10. Given / : X —> Y and A, B Cl, prove that 

(i) f(AUB) = f(A)Uf(B), 

(ii) f(AnB) C f(A)nf(B), 

(iii) f(A\B) C f(A)\f(B) if / is one-one. 

11. Given / : X -> Y. Let S C Y. Define /“'(S) = {x 6 X \ f(x) e S}. Let 
A, B C.Y. Prove that 

(i) f- l (AuB) = f-\A)Uf-\B), 

(ii) r 1 (-4nB) = /-‘(A)n/- 1 (fi), 

(iii) f-\A\B) = f-'(A)\f-\B). 

12. Let / : A —► B. Let /* be the inverse relation, i.e., 

/* = {(y,x) £ B x A | f(x) - 2 /}. 

(i) Show by an example that /* need not be a function. 

(ii) Show that f* is a function from Z(/) into A if and only if / is one-one. 

(iii) Show that f* is a function from B into A if and only if / is one-one 
and onto B. 

(iv) Show that if /* is a function from B into A , then / -1 = /*. 

13. Show that Z ~ E, where E is the set of all even integers. 

14. Let A = {xGR|0<(e< 1} and 5 = {xGR|5<r< 8}. Show that 
/ : A —» B defined by f{x) — 5 + (8 — 5)x is a one-one function from A 
onto B. 

15. (i) Show that Z and 3Z are equipollent. 

(ii) Show that 5Z and 7Z are equipollent. 

16. Let S — {x € R | 0 < x < 1}. Show that R + ~ S. 

17. (Schroder-Bernstein) Let A and B be sets. If A ~ Y for some subset 
Y of B and B ~ X for some subset X of A, prove that A ~ B. 

18. Find a one-one mapping from R onto R + . 

19. Is Z ~ Q? 

20. Let i4 = {rGR|0<r<l} and B = {x 6 R | 0 < x < 1}. Is it true 
that A ~ B? 
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21. For each of the following statements, write the proof if the statement is 
true, otherwise give a counterexample. 

(i) A function f : A B is one-one if and only if g o f = h o f for all 
functions g,h : B —» A. 

(ii) A function / : A —» B is one-one if and only if for all subsets C of A, 
f(A\C) 2 B\f(C). 


1.6 Binary Operations 

The concept of a binary operation is very important in abstract algebra. Thro¬ 
ughout the text we will be concerned with sets together with one or more 
binary operations*. In this section, we define binary operations and examine 
their basic properties. 

Definition 1.6.1 Let S be a nonempty set. A binary operation on S is a 
function from, S x S into S. 

For any ordered pair (x,y) of elements x,y £ S, a binary operation assigns 
a third member of S. For example, 4- is a binary operation on Z which assigns 
3 to the pair (2,1). 

If * is a binary operation on S , we write x * y for *(x,y), where x,y £ S'. 
Since the image of * is a subset of S', we say S is closed under *. 

Z is closed under 4- since if we add two integers we obtain an integer. Since 
2,5 € N and 2 — 5 = —3 ^ N, we see that — (subtraction) is not a binary 
operation of N and we say that N is not closed under —. 

Definition 1.6.2 A mathematical system is an ordered (n 4- 1 )-tuple (S, 
where S is a nonempty set and is a binary operation on S', 
i — 1 , 2, ... ,n. S is called the underlying set of the system,. 

Definition 1.6.3 Let (S', *) be a m,athem,atical system,. Then 

(i) * is called associative if for all x, y, z £ S, x * (y * z) = (x * y) * z. 

(ii) * is called commutative if for all x,y £ S,x * y — y * x. 

Example 1.6.4 Consider the m,ath,em,a,tica,l system, (Z,+). Since addition of 
integers is both associative and commutative, 4- is both associative and com¬ 
mutative. 


Example 1.6.5 Let A be a nonempty set. Let S be the set of all functions on 
A, i.e., 

S = {f \ S ■ A —> A}. 

Since composition of functions is a function (Theorem, 1.5.11), (S, o) is a math¬ 
ematical system,. By Theorem, 1.5.13, o is associative. 
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Example 1.6.6 Let M 2 (R) be the set of all 2 x 2 matrices over R, i.e., 

a, 6, c, d (0 R j . 

Lei + denote the usual addition of matrices and ■ denote the usual multiplica¬ 
tion of matrices. Since addition (multiplication) of 2x2 matrices over R is a 
2x2 matrix over R, it follows that + ( • ) is a binary operation on M 2 (R). 
Hence, (M 2 (R),+, •) is a mathematical system:. Note that + is both associative 
and commutative and, ■ is associative, but not commutative. 

The following is an example of a mathematical system for which the binary 
operation is neither associative nor commutative. 

Example 1.6.7 Consider the mathematical system, (Z where — denotes 
the binary operation of subtraction on Z. Then 3 — (2 — 1) = 2 ^ 0 = (3 — 2) — 1 
and so — is not associative. Also, since 3 — 2^2 — 3, — is not commutative. 

A convenient way to define a binary operation on a finite set S is by means 
of an operation or multiplication table. For example, let S = {a, 6, c}. Define 
* on S by the following operation table. 


* 

a 

b 

c 

a 

c 

b 

a 

b 

a 

a 

a 

c 

b 

b 

b 


To determine the element of S assigned to a*b, we look at the intersection 
of the row labeled by a and the column headed by b. We see that a * b = b. 
Note that b * a = a. 

Definition 1.6.8 Let ( S , *) be a mathematical system,. An elem,ent e e S is 
called an identity of ( S , *) if for all x G S, 


M 2 (R) = 


e*x = x = x*e. 


Example 1.6.9 Let S = {e,a,b}. Define * on S by the following multiplication 
table 


* 

e 

a 

b 

e 

e 

a 

b 

a 

a 

a 

a 

b 

b 

a 

a 


We note that e*a = a = a*e, e*b = b = b*e and, e*e = e = e*e. Thus, e 
is an identity of ( S , *). 
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Example 1.6.10 (i) In Example 1.6.5, %a * s an identity element of (S, o). 

0 0 


(ii) In Example 1.6.6, 

system. (M 2 (R), +) and 
system, (M 2 (R),-). 


0 0 

1 0 ' 
0 1 


is an identity element for the mathematical 
is an identity elem.ent, for the mathematical 


Theorem 1.6.11 An identity element, (if it exists) of a m.athemntical system, 
(S', *) is unique. 


Proof. Let e, f be identities of (S , *). Since e is identity, e * a = e for all 
a £ S. Substituting / for a, we get 


e * / = e. (1-4) 

Now / is identity and so a * / = / for all a e S. Substituting e for a we get 

e*f = f. (1.5) 

From Eqs. (1.4) and (1.5), we get e = f. Hence, an identity element (if it 
exists) is unique. ■ 

1.6.1 Worked-Out Exercises 

0 Exercise 1 Which of the following are associative binary operations? 

(i) (Z, *), where x * y = (x + y) — (x • y) for all x,y £ Z. 

(ii) (R, *), where x * y = max(x, y) for all x, y € R. 

(iii) (R, *), where x * y = \x + y\ for all x, y E R. 

Solution: (i) (x * y) * z = ((x + y) — (x • y)) * z — (x + y) — (x ■ y) + z 
— ((x + y)— (x • y)) • z = x + y-\- z — x • y — x ■ z —y ■ z + x ■ y • z. Similarly, 
x*(y*z) = x + y 4- z— x • y — x • z — y ■ z+ x-y ■ z. Thus, (x * y) * z = x * (y * z). 
Hence, * is associative. 

(ii) (x * y) * z = max(x, y) * z = max(max(x, y),z) = max(x, y, z) — max(x, 
max(y, z)) = x * max(y, z) = x * (y * z). Thus, * is associative. 

(iii) (2* (-3)) *6= |2 + (-3)|*6 = 1*6 = |1 + 6| = 7 and 2 * ((-3) * 6) = 
2 * (|(—3) + 6|) = 2 * 3 = |2 + 3| - 5. Hence, (2 * (-3)) * 6 / 2 * ((-3) * 6) and 
so * is not associative. 
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1.6.2 Exercises 

1. Which of the following are associative binary operations? 

(i) (N, *), where x * y = x y for all x, y 6 N. 

(ii) (Z, *), where x * y = x 4- y + 1 for all x, y € Z. 

(iii) (N, *), where x * y ~ gcd(:r, y) for all x, y £ N. 

(iv) (N, *), where x * y — lcm (x,y) for all x,y E N. 

(v) (R, *), where x * y = min(x, y) for all x, y € R. 

(vi) (R, *), where z * y = |x| -f \y\ for all x, y € R. 

2. In Exercise 1, which of the operations are commutative? 

3. In Exercise 1, which mathematical systems have an identity? 
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Carl Friedrich Gauss (1777-1855) 
was born on April 30, 1777, in Brunswick, 
Germany. Gauss is considered to be one of 
the last mathematicians to know everything 
in his subject. 

Gauss’s genius was revealed at a very 
early age. He was able to do long calcula¬ 
tions in his head. He rediscovered the law 
of quadratic reciprocity, related the arithm¬ 
etic-geometric mean to infinite series expan¬ 
sion, and conjectured the prime number the¬ 
orem. Before the age of twenty, he showed 
that a regular polygon of seventeen sides was 
constructive with ruler and compass—an un¬ 
solved problem since Greek times. At the 
age of twenty, he published the first proof of the fundamental theorem of algebra. He 
completed his Ph.D. at the University of Helmstedt, under the supervision of Pfaff, 
when he was twenty-two . 

In 1801, Gauss published his monumental book on number theory, Disquisitiones 
Arithmeticae. In his Disquisitiones , Gauss summarized previous work in a systematic 
way and solved some of the most difficult outstanding questions. He introduced the 
notion of congruence of integers modulo an integer (a = b mod(c)) and extensively 
studied Z n and obtained many of its important properties. He is credited for coining 
the term complex number and the notation i for y/ — 1. He showed that Z[i] is a 
unique factorization domain. In his honor, Z[i] is called the ring of Gaussian integers. 
Disquisitiones laid the foundations of algebraic number theory. Leopold Kronecker 
said, “It is really astonishing to think a single man of such young years was able to 
bring to light such a wealth of results, and above all, to present such a profound and 
well-organized treatment of an entirely new discipline.” 

Besides being a mathematician he was also a physicist and an astronomer. In 
January 1801, a new planet was briefly observed, which the astronomers were unable 
to locate later. Gauss calculated the position of the planet by using a more accurate 
orbit theory than the usual circular approximation. Gauss used a theory based on the 
ellipse. At the end of the year the planet was discovered at the precise location he 
predicted. The methods he developed are still in use. They include the theory of least 
squares. 

He was appointed director of the observatory at Gottingen and remained there for 
forty years. Gauss disliked teaching and preferred his job at the observatory. He usu¬ 
ally rejected students who sought his guidance. However, he did accept students such 
as Dedekind, Dirichlet, Eisenstein, Riemann, and Kummer, who themselves became 
famous mathematicians. Gauss died on February 23, 1855. As E.T. Bell has said, “He 
lives everywhere in mathematics.” 






Chapter 2 

Introduction to Groups 


There are four major sources from which group theory evolved, namely, classical 
algebra, number theory, geometry, and analysis. Classical algebra originated in 
1770 with J.L. Lagrange’s work on polynomial equations. His work appeared 
in a memoir entitled, “Reflexions sur la resolution algebrique des equations.” 
C.F. Gauss is considered the originator of number theory with his work, u Dis- 
quistiones Arithmeticae ,” which was published in 1801. F. Klein’s lecture in 
1872, “A Comparative Review of Recent Researches in Geometry,” dealt with 
the classification of geometry as the study of invariants under groups of trans¬ 
formations. The impact of his lecture was so strong as to allow Klein to be 
considered as the originator of this source of group theory. The originators of 
the analysis source are S. Lie (1874) and H. Poincare and F. Klein (1876). 

2.1 Elementary Properties of Groups 

In this chapter, and in fact in the remainder of the text, we will be concerned 
with mathematical systems. These systems are composed of a nonempty set 
together with binary operations defined on this set so that certain properties 
hold. From these properties, results concerning these systems are derived. This 
axiomatic approach to abstract algebra unifies diverse examples and also strips 
away nonessential ideas. 

Although noted for his geometry, Euclid inspired the use of the axiomatic 
method, which has proved so indispensable in mathematics. His axiomatic 
approach also affected philosophy, where in the 17th century Baruch Spinoza 
laid down (in The Ethics ) an axiomatic system from which he was able to 
prove the existence of God. His proof, of course, depended on his axioms. His 
proof lost its conviction with the emergence of noneuclidean geometries whose 
axioms were as logical and practical as Euclid’s. 

We will be primarily concerned with mathematical systems called groups 
in this chapter. The theory of groups is one of the oldest branches of abstract 
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algebra. The first effective use of groups was in the early nineteenth century 
by A. Cauchy and E. Galois. They used groups to describe the effect of permu¬ 
tations of roots of a polynomial equation. Their use of groups was not based 
on an axiomatic approach. In 1854, A. Cayley gave the first postulates for a 
group. However, his definition was lost sight of. Kronecker again set down the 
axioms for an Abelian group in 1870. H. Weber gave the definition for finite 
groups (in 1882) and the definition for infinite groups in 1883. 

As previously mentioned, the notion of a group arose from the study of 
one-one functions on the set of roots of a polynomial equation. We have seen 
that the set S of all one-one functions from a set X onto itself satisfies the 
following properties: 

(i) Composition of functions, o, is a binary operation on S. 

(ii) For all f,g,h£S,fo(goh) = (fog)oh. 

(iii) There exists i £ S such that f oi — f = i o f for all / € S. 

(iv) For all f € S there exists an element f~ l £ S such that / o = i — 

r 1 of. ' 


These properties lead us to the definition of an abstract group. 


Definition 2.1.1 A group is an ordered pair (G, *), where G is a nonempty 
set and * is a binary operation on G such that the following properties hold: 

(Gl) For all a, b, c £ G, a * (6 * c) = (a * 6) * c ( associative law). 

(G2) There exists e £ G such that for all a £ G, a*e — a — e*a (existence 
of an identity). 

(G3) For all a £ G, there exists b £ G such that a*6 — e — b*a (existence 
of an inverse). 

Thus, a group is a mathematical system (G, *) satisfying axioms Gl to G3. 

In what follows, we will see several examples of groups. However, let us 
first observe the following important properties of groups. 


Theorem 2.1.2 Let (G,*) be a, group. 

(z) There exists a. unique element e £ G such that e*a = a — a * e for all 
aeG. 

(ii) For all a £ G, there exists a unique b £ G such that a*b = e = b*a. 


Proof. (i) By G2, there exists e £ G such that e * a — a = a * e for all 
a £ G. Since (G,*) is a mathematical system, e is unique by Theorem 1.6.11. 

(ii) Let a £ G. By G3, there exists b £ G such that a*b = e = 5*a. Suppose 
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there exists c € G such that a*c = e = c*a. We show that b — c. Now 


b — b * e 
= b * (a * c) 

— (6 * a) * c 
= e * c 

— c. 


(substituting e = a * c) 
(using the associativity of *) 
(since b * a = e) 


Thus, b is unique. ■ 

The unique element e € G that satisfies G2 is called the identity element 
of the group (G, *). Let a £ G. Then the unique element b £ G that satisfies 
G3 is called the inverse of a and is denoted by a" 1 . 

If a group (G, *) has the property that a * b = b * a for all a, b £ G, then 
(G, *) is called a commutative or Abelian group. A group (G, *) is called 
noncommutative if it is not commutative. 


Example 2.1.3 Consider Z, the set of integers, together with the binary op¬ 
eration +, where T is the usual addition. We know that + is associative. Now 
0 £ Z and for all a £ Z, a + 0 = a = 0 + a and so 0 is the identity. Also, for 
all a £ Z, —a £ Z and a + (—a) = 0 = (—a) + a. That is, —a is the inverse 
of a. Hence, it now follows that (Z, +) is a group. Since a + b = b + a for all 
a,b £Z, + is comnmtative. Thus, (Z,+) is a commutative group. 

Similarly, we can show that (Q,+), (R,+), (C,+), (Q\{0},-), (R\{0},-), 
(C\{0}, •) are all examples of commutative groups, where + is the usual ad¬ 
dition and ■ is the usual multiplication. Note that for each of the groups 
(Q\{0}, ■), (R\{0},-), (C\{0},.) the identity el.em.ent is 1. 

Example 2.1.4 Let a be any fixed integer. Let G — {na \ n £ z}. Then (G, +) 
is a commutative group, where + is the usual addition of integers. Note that 
0 — 0 • a and —(na) — (—n)a are members of G. 

Gauss’s work yielded many new directions of research in Abelian groups. 
The next two examples are due to Gauss. 


Example 2.1.5 Consider Z n (Exam,pies 1.3.11 and 1.3.17). Define + n on Z n 
by 

[a] + n [b] = [a + 6] 


for all [a], [b] £ Z n . We show that (Z n ,+ n ) is a commutative group. 

We first prove that + n is a binary operation. Let [a], [6], [c], [d] £ Z n . Sup¬ 
pose [a] = [c] and [6] = [d], Th.enn\(a — c ) and.n\(b — d), i.e., there exist integers 
s andt such that ns — a — c andnt = b — d. Hence, n(s + t) = ((a + b) — (c + d)) 
and so n|((a + 6) — (c + d)). This implies that a + 6 = n c + d. Therefore, 
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[a + 6] = [c+d\. As a result -\- n is well defined and so + n is a binary operation. 
For all [a], [ b ], [c] G Z n , ([a] + n [ b ]) + n [c] = [a + b] + n [c] = [(a + 6) + c] = 
[a + (6 + c)] = [a] + n [6 + c] — [a] + n ([6] + n [c]). Hence, + n is associative. Now 
[0] € Z n and for all [a] € Z n , 

[a] + n [0] = [a + Q] = [a] — [0 + a] = [Oj + n [a]. 

This shows tha,t [0] is the identity element. Also, for all [a] G Z n , [—a] G Z n 
and, 

[a] + n [-a] = [a - a] = [0] = [-a + a] = [-a] + n [aj. 

Thus, [—a] is the inverse of [a]. Finally, for all [a], [6] G Z n 

[°] +n [6] — [a + b] — [b -f a] — [6] + n [a] 

a,nd, so + n is comm,uta,tive. Hence, (Z n ,+ n ) is a commutative group. 

Example 2.1.6 Consider Z n (Examples 1.3.11 and 1.3.17). Define - n on Z n 
by 

[a] •„ [6] = [ab] 

for all [a], [b] G Z n . the help of a little calculation as in Example 2.1.5, 

we ca.n show that • n is a binary operation on Z n and • n is associative. Now 
[1] G Z n and for all [a] G Z n , 

[a] - n [1] = [a • 1] = [a] = [1 • a] = [1] - n [a]. 

77?is implies that [1] is the identity element. We now show that if [a] G Z n and 
[a] ^ [0], then [a] has an inverse if and only i/gcd(a,n) = 1. 

Let [a] G Z n and [a] ^ [0]. Suppose gcd(a,n) = 1. Then there exist b,r G Z 
such tha.t ab P nr = 1 by Theorem, 1.2.11, i.e., ab — 1 = nr. This implies that 
[a&] = [1] or [a] - n [6] = [1]. Since ab = ba , we a/so have [6] - n [a] = [ ba} — [ab] — 
[1]. Thus, there exists [6] G Z n such that [a] [6] = [1] = [6] [a] and so [a] has an 
inverse. Conversely, suppose [a] G Z n , [a] [0] and [a] has an inverse. Then 

there exists [6] G Z n such that [a][6] = [1]. This implies that n\(ab — 1) (by 
Exercise 11, page 30^ and so ab — 1 — nr for som,e r G Z. Thus, ab + nr = 1 
and hence by Theorem, 1.2.11, gcd(a,n) = 1. This proves our claim,. 

Thus, we see that in general, not every elem,ent o/Z n \{[0]} has an inverse. 
For example if n — 6, then the only elem.ents of Zq that have inverses are [1], 
[3] and [5]. Hence, in general (Z n \{[0]}, - n ) is not a group. 

Let U n be the set of all elements of Z n \{[0]} that have an inverse in 
(Z„\{[0]}, n), i-e., 

U n = {[a] G Z n \{[0]}| gcd(a, n) = 1}. 

We ask the reader to verify in Exercise 10 (page 78) that (U n , - n ) is a group. 
Note that for n = 8, Us — {[1], [3], [5], [7]} and for n ~ 7, 

r 7 = {[l],[2],[3],[4],[5],[6]} = Z 7 \{[0]}. 
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Example 2.1.7 Let 


Q[\/2] = {a + bV 2 | a, b E Q}. 

Tften (Q[\/2], +) and (Q[V2]\{0}, •) are commutative groups, where + is the 
usual addition and • is the usual multiplication. The identity of (Q[%/2],+) 
is 0 + 0\/2 and the inverse of a 4- &\/2 is —a + (—6)V2. T/ie identity of 
(Q(v/2]\{0}, •) is 1 = 1 + 0\/2 and the inverse of a + 6\/2 ^ 0 is a 2°2i? ~ 

a i* 2b i 

Example 2.1.8 Let V(X) be the power set of a set X. Consider the operation 
A (symmetric difference, Exercise 6, page 6) on V(X). Then for all A , B 
eV(X), 

AXB = ( A\B ) U (B\A). 

(V(X), A) is a commutative group. The empty set (p is the identity of (fP(X), A) 
and every element ofV(X) is its own inverse. We warn the reader that veri¬ 
fication of the associative law is tedious. 

Example 2.1.9 Let X be a set and Sx the set of all one-one functions of X 
onto X. Since ix, the identity function on X , is one-one and onto X, ix E Sx- 
Thus, Sx 7^ 4>- Let f,g € Sx- Then fog is a one-one function of X onto X by 
Theorem. 1.5.11. Hence, fog £ Sx- By Theorem 1.5.13, o is associative. Also, 
for all f € S x , / _1 G S x and f o f- 1 = i x = / -1 ° f- Consequently, (Sx, °) 
is a group. However, ( Sx ,°) is not necessarily commutative. For example, 
let X = {a,b,c}. Let f,g E Sx be defined by f(a) = b, f(b ) = a, f(c ) = c, 
g(a) = b , g(b) ~ c, g(c) - a. Then (f o g)(b) = f(g(b)) = f(c) - c and 
(g ° f)(b) = g(f(b)) = g(a) = b. Hence, f o g ± g o f. Thus, (S x ,°) is not 
commutative. 


Example 2.1.10 Let GL( 2,R) = 

Define a binary operation * on GL( 2, R) by 


a b 
c d 


a , b, c, d E R, ad — be ^ 0 


a 

c 



u v 
w s 


au + bw av + bs 
cu -f dw cv + ds 


for all 


a b 


c d 

> 


u v 
w s 


E GL( 2,R). This binary operation is the usual 


m.atrix multiplication. Since m,atrix multiplication is associative, we have * is 

1 0 


associative. The element 


0 1 


E GL( 2, R) and is the identity element of 
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GL( 2,R). Let 


a b 
c d 


G GL(2, R). Then ad — be 7 ^ 0. Consider the m,atrix 


d -b 

ad—be ad—be 
— c a 

ad—be ad—be 


. Since 


d a —b —c 

ad — be ad — be ad — be ad — be 


1 

ad — be 


7^0, 


we have 


Now 


and 


Thus, 


d 

ad—be 
—c 

ad—be 


group. Now 


and 


d -b 

ad i)c ad 6c 

— C Q 

ad—be ad—be 


G GL{ 2,R). 


a b 


r d 

-b 1 


’ 1 

0 " 

* 

ad—be 

ad—be 

— 

1 

■T3 

O 

_i 


— c 

a 


0 

1 


ad—be 

ad—be 



r d 

—b 1 


a 

b' 


~ i 

0 

o" 
1 

ad—be 
— c 

ad—be 
a 

* 

— 

ad—be 

ad—be 


c 

a 



-6 

ad—be 
a 

ad—be 


is the inverse of 


a b 
c d 


. Hence, {GL(2, R),*) is a 


’ll" 


1 

O 

t-H 

l_ 

0 1 

5 

1 1 


G GL{ 2, R) 


’ll" 


" 1 0 ' 


' 21 ' 


’ll’ 


’l0’ 


’ 1 1 ’ 

0 1 

* 

1 1 


1 1 

1 2 


1 1 

* 

0 1 


Hence, ( GL(2 , R), *) is a noncommutative group. 


The group in Example 2.1.10 is known as the general linear group of 
degree 2 . 

We now prove some elementary properties of a group in the following the¬ 
orem. 


Theorem 2.1.11 Let (G, *) be a group. 

(i) (a -1 ) -1 = a for all a G G. 

(ii) (a * 6) _1 = b~ l * a ' 1 for all a,b G G. 

(Hi) (Cancellation Law) For all a,b,c G G, if either a*c = 6*c or 
c * a — c * b, then a = b. 

(iv) For all a,b G G, the equations a* x = b and y * a = b have unique 
solutions in G for x and y. 
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Proof. (i) Let a E G. Then a~ l * a = e = a * a -1 and so a is an inverse of 
a -1 . Since the inverse of an element is unique in a group (Theorem 2.1.2) and 
since (a -1 ) -1 denotes the inverse of a -1 , it follows that a — (a^ 1 )' 1 . 

(ii) Let a, b £ G. Then 

(a * b) * (6 _1 * a -1 ) = ((a * 6) * 6 _1 ) * a -1 

= (a * (6 * b ~ 1 )) * a~ l 
= (a * e) * a' 1 
— a * a -1 
= e. 

Similarly, (6 -1 * a' 1 ) * (a * b) — e. Hence, 6 _1 * a~ 1 is an inverse of a*b. Since 
the inverse of an element is unique in a group and since (a * 6) _1 denotes the 
inverse of a * b, it follows that (a * 6) -1 = b~ l * a -1 . 

(iii) Let a, 6, c £ G. Suppose a * c = b * c. Now (a * c) * c~ l — (b * c) * c~ l 
implies that a* (c * c _1 ) = 6 * (c * c -1 ). Hence, a * e — b * e or a = b. Similarly, - 
if c * a = c * b, then a — b. 

(iv) Let a,b £ G. First we consider the equation a*x = b. Now o^ 1 *b £ G. 
Substituting a -1 * b for x in the equation a * x = b, we obtain 

a * (a -1 * b) = (a * a -1 ) * b = e * b = b. 

Thus, a -1 * b is a solution of the equation a * x = b. We now establish the 
uniqueness of the solution. Suppose c is any solution of a * x — b. Then 
a * c = b. Hence, 

c = e * c 

= (a -1 * a) * c (since a -1 * a = e) 

= a" 1 * (a * c) (since * is associative) 

= a -1 * b (since a * c = b). 

This yields the uniqueness of the solution. Similar arguments hold for the 
equation y * a — b. ■ 

Corollary 2 .1.12 Let (G, *) be a group and a £ G. If a* a = a, then a = e. ■ 

Proof. Since a = a * a, we have a* a ~ a* e. By the cancellation law, a = e. 

Corollary 2.1.13 In a multiplication table for a group (G, *), each element 
appears exactly once in each row and. exactly once in each column. 
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Proof. Let b £ G be such that b occurs twice in the row marked by a £ G. 
Then there exists u,v £ G with u ^ v such that a * u = b and a * v — b. 
Thus, the equation a * x — b has two distinct solutions, u and v. This is a 
contradiction to Theorem 2.1.11(iv) since the equation a * x = b has a unique 
solution for x. A similar argument for columns can be used. ■ 

Let (G, *) be a group and a, b, c £ G. Then by the associative law, a*(6*c) = 
(a * b) * c. Hence, we can define a * b * c = a * (6 * c) = (a * 6) * c. Let a, b, c, d £ G. 
Then (a * b * c) * d — (a * (6 * c)) * d = a * ((6 * c) * d)) = a * (b * (c * d)) = 
(o * b) * (c * d) = ((a * b) * c) * d. Thus, there is more than one way of inserting 
parentheses in the expression a * b * c * d to produce a “meaningful product” 
of a,b,c,d (in this order). We now extend this notion to any finite number of 
elements. 

Definition 2.1.14 Let (G, *) be a group and ai, 02 ,..., a n £ G be n elements 
of G (not necessarily distinct). The meaningful product of a 1 , 02 ,. .. , a n (in 
this order) is defined as follows: If n = 1, then the meaningful product is ai. If 
n > 1, then the meaningful product of a\,a 2 ,... ,a n is any product of the form 1 

(ai * ■ • • * a m ) * (a m +i * ■ • ■ * a n ), 

where 1 < m < n and (a\* ••• * a m ) and (a m+ i* * a n ) are meaningful 
products of m and n — m, elements, respectively. 

Definition 2.1.15 Let (G, *) be a group and ai,a 2 ,...,a n £ G, n > 1. The 
standard product of 01 , a 2 ,..., a n denoted by ai* < 22 * ■■■ * a n is defined re¬ 
cursively as 


a\ — a\ 

ai * a 2 * • • • * a n = (ai * 02 * • ■ • * a n _ 1 ) *o n if n > 1. 

In the next theorem, we establish the equality between any meaningful 
product and standard product. 

Theorem 2.1.16 Let (G, *) be a group and ai, 02 ,..., a n £ G, n > 1. Then 
all possible meaningful products of a\, 02 ,..., a n (in this order) are equal to the 
standard product of a\,a 2 , ... ,a n (in this order). 


Proof. We prove the result by induction. If n = 1, then a\ is the only mean¬ 
ingful product of ai, which is equal to the standard product a\ of a\. Thus, the 
result is true if n — 1. Suppose that the theorem is true for all integers m. such 
that 1 < m < n. Let a \, < 22 , ..., a n £ G. Let (ai* ■ • • * a t ) * (fy+i* - • - * a n ) be a 
meaningful product of 01 , ( 22 , .. ■, a n (in this order). Now t < n and n — t < n. If 
t — n — 1, then (a 1 * 0 . 2 * ■ • ‘*at)*at +1 = a\ *( 22 * ■ ■ ■ *a t *a t+ i. Suppose t < n— 1. 


~Ph.y-A.LG-A. 
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Then (ai * • ■ • *a t ) * (at+i* • - ■ *a n ) = (ai* • • • *a, t ) * ((at+i* ■ • • *a n _i) *a n ) = 
((ai* * a t ) * (a i+ i* • • • * a„_i)) * a n = (a\ * a 2 * • • • * a n _i) * = ai* 

• • • * a n since by the induction hypothesis (a-i* * at) * (at+i* • • • * a„_i) = 

a\ * a 2 * • • • * a n -i- Hence, the result is true for n. The result now follows by 
induction. ■ 


We have seen several examples of groups. In order to show that a given 
set with a given binary operation is a group, we need to verify Gl to G3 of 
Definition 2.1.1. However, it would be helpful if we had some criteria that 
could be used to show whether a given set with a binary operation is a group 
or not instead of verifying all the properties G1-G3 explicitly. Partly for this 
reason we define what a semigroup is. Following the examples, we develop some 
results that can be used to test whether a given set with a binary operation is 
a group or not. 

Definition 2.1.IT A semigroup is an ordered pair ( S , *), where S is a nonem¬ 
pty set and * is an associative binary operation on S. 

Thus, a semigroup is a mathematical system with one binary operation 
such that the binary operation is associative. We note that every group (G, *) 
is a semigroup. 

A semigroup ( S , *) is commutative if * is commutative, i.e., a * b = b * a 
for all a,b E S. A semigroup (S, *) which is not commutative is called non- 
commutative. 


Let (S, *) be a semigroup. We say that (S, *) is with identity if the mathe¬ 
matical system (S, *) has an identity. An element a 6 S is called idempotent 
if a * a — a. 


Example 2.1.18 Consider N, the set of positive integers. We know that ad¬ 
dition of positive integers is again a positive integer. Thus, + is a binary 
operation on N. We also know that + is associative a,nd commutative. Thus, 
(N, +) is a commutative semigroup. 


Example 2.1.19 Let X be a nonempty set and S the set of all functions f : 
X —> X. If o denotes the composition of functions, then (S', o) is a semigroup 
with identity. The associativity of o follows from. Theorem, 1.5.13. When X has 
two or m,ore elements, the semigroup (S, o) is noncom,mutative. For example, 
let X — {a, 6}. Let g,h £ S be defined by g(a) = 6, g(b) = 6, h(a) = 6, h(b) = a. 
Then (goh,)(a) = b ^ a = (hog)[a). Therefore, goh ^ hog. Let f £ S be defined 
by f(a) = a and f(b) = a. Now (fog)(x) = f (g(x)) =a = f(h(x)) = (foh)(x) 
for all x £ G. Hence, fog = f oh. But g ^ h. This shows that the cancellation 
la,ws do not hold in S. Thus, (S, o) is not a group. 
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Example 2.1.20 Let X be a set with two or m,ore elements and S' the set of all 
functions f : X —> X which are not one-one. Then (S',o) is a noncommutative 
semigroup without identity. 

Example 2.1.21 LetX be a set andV(X) the power set of X. Then (V(X), U) 
and (V(X),r\) are commutative semigroups with identity. The identity of 
(V(X),\J) is (f) and, the identity of (V(X), Pi) is X. 

The following three theorems give necessary and sufficient conditions for a 
semigroup to be a group. 

Theorem 2.1.22 A semigroup (5,*) is a group if and only if 

(i) there exists e G S such that e * a = a for all a G S and 

(ii) for all a G S there exists b G S such that b * a = e. 

Proof. Suppose (S’, *) is a semigroup that satisfies (i) and (ii). Let a be any 
element of S. Then there exists b G S such that b * a = e by (ii). For b G S, 
there exists c G S such that c * b — e by (ii). Now 

a = e*a = (c*6)*a — c * (6 * a) = c * e 


and 


a*b = (c * e) * b = c * (e * b) = c * b = e. 


Hence, a*b~e = b*a. Also, 


a*e — a*(6*a) = (a*6)*a = e*a — a. 

Thus, a * e = a — e * a. This shows that e is the identity element of S. Now 
since a*b=e = b*a, we have b = a" 1 . Therefore, (S, *) is a group. The 
converse follows from the definition of a group. ■ 

Theorem 2.1.23 A semigroup (S, *) is a group if and only if for all a,b G S 
the equations a * x = b and y * a = b have solutions in S for x and y. 


Proof. Suppose the given equations have solutions in S. Let a G S. Consider 
the equation y * a = a. By our assumption, y * a ~ a has a solution u G S', say. 
Then u * a — a. Let b be any element of S. Consider the equation a * x = b. 
Again by our assumption, a*x = b has a solution in S. Let c G S be a solution 
of a * x = b. Then a * c = 6. Now 


u * b 


u * (o * c) 
(u * a) * c 
a * c 


b. 


(since b = a * c) 

(since * is asociative) 
(since u * a = a) 



2.1. ELEMENTARY PROPERTIES OF GROUPS 


67 


Since b was an arbitrary element of S, we find that u * b = b for all b £ S. 
Thus, (S',*) satisfies (i) of Theorem 2 . 1 . 22 . Consider the equation y * a = u. 
Let d £ S be a solution of y * a = u. Then d * a = u. This shows that (S', *) 
satisfies (ii) of Theorem 2.1.22. Hence, (S, *) is a group by Theorem 2 . 1 . 22 . 

The converse follows by Theorem 2.1.11(iv). ■ 

Theorem 2.1.24 A finite semigroup (S',*) is a group if and only if (S, *) 
satisfies the cancellation laws (i.e., a * c = b * c implies a = b and c * a — c * 6 
implies a = b for all a, 6 , c £ S). 

Proof. Let (S, *) be a finite semigroup satisfying the cancellation laws. Let 
a, b £ S. Consider the equation a * x = b. We show that this equation has a 
solution in S. Let us write S ~ {ai, ai, ..., a n }, where the afi s are all distinct 
elements of S. Since S is a semigroup, a * a* £ S for all i = 1 ,2,..., n. Thus, 
{a*ai, a*a 2 , ..., a*a n } C S. Suppose a*a* — a*Oj for some i 7 ^ j. Then by the 
cancellation law we have = aj, which is a contradiction since a* 7 ^ aj. Hence, 
all elements in {a * a\, a * 02 , ..., a * a n } are distinct. Thus, S — {o * ai, a * 02 , 
..., a*a n }. Let b € S. Then b = a *for some £ S. Therefore, the equation 
a*x = b has a solution in S'. Similarly, we can show that the equation y * o — 6 
has a solution in S. Hence, by Theorem 2.1.23, (S', *) is a group. The converse 
follows by Theorem 2.1.11 (iii). ■ 

Let (G, *) be a group, a £ G, and n £ Z. We now define the integral 
power a n of a as follows: 


e 

a * a n ~ l if n > 0 
{a~ l )~ n if n < 0 . 

Note that a n — (a ~ n ) -1 if n < 0. In the exercises at the end of this section, 
we ask the reader to verify certain basic properties of integral powers. It should 
be pointed out that when we use additive notation for the binary operation *, 
we speak of multiples of an element a of the group (G,+), which are defined 
as follows: 

0 a — 0 , where the 0 on the right-hand side denotes the identity of the 
group (G, +) and the 0 on the left-hand side denotes the integer 0 . 
na — al(n-l)o if n > 0 

na — (—n)(— a) if n < 0 . 

For example, in (Z 6 ,+ 6 ), 2[3] = [3] +6 [3] = [ 6 ] = [0]. By the notation na, 
we do not mean n and a multiplied together since no multiplicative operation 
between elements of Z and G has been defined. 

“Pi l*ic- 
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Definition 2.1.25 A group (G,*) is coiled a finite group if G has only a 
finite number of elements. The order, written |G|, of a group [G, *) is the 
number of elements of G. 

Example 2.1.5 shows that for every positive integer n, there is a commuta¬ 
tive group of order n. 

The groups in Examples 2.1.5 and 2.1.6 are finite groups. 

A group with an infinite number of elements is referred to as an infinite 
group. Klein and Lie’s use of groups in geometry influenced the turn from 
finite groups to infinite groups. 

The groups in Examples 2.1.3, 2.1.4, and 2.1.7 are infinite groups. 

Let G be a finite group and a <G G. Now a 2 = a * a € G and by induction, 
we can show that a m E G for all m, > 1. Thus, {a, a 2 ,..., a m ,.. .} C G. Since 
G is finite, all elements of the set {a, a 2 , ..., a m , ...} cannot be distinct. Hence, 
a k = a 1 for some positive integers k,l, k > l. This implies that a k ~ l — e. Let 
us write n = k — l. Therefore, a n = e for some positive integer n. Also, if G is 
an infinite group and a E G, then it may still be possible that a n = e for some 
positive integer n. This leads us to the following definition. 

Definition 2.1.26 Let ((7,*) be a, group and, a 6 G. If there exists a positive 
integer n such that a n = e, then the smallest such positive integer is called, the 
order of a. If no such positive integer n exists, then we say that a is of infinite 
order. 


We denote the order of an element a of a group ( G , *) by o(a). 

The concept of the order of an element is very important in group theory. 
We shall see in later chapters how effectively information about the order of 
an element of a group reveals the nature of the group and in several instances 
leads us to determine the structure of the group itself: 

Example 2.1.27 Consider the group (Z 6 ,+ 6 )- Ze has order 6. The elements 
[0], [1], [2], [3], [4], [5] ha,ve orders 1, 6 , 3, 2, 3, 6 , respectively. For example 
2[3] — [3] +6 [3] — [ 6 ] — [0] and 2 is the smallest positive integer n such that 
n[3] = [0], 

Let G be a group and a E G. If o(a) is infinite, then by the definition of 
the order of an element it follows that o (a k ) is also infinite for all k > 1 , i.e., 
the order of every positive power of a is also infinite. If o(a) is finite, then the 
next theorem tells us how to compute the order of various powers of a. 


Theorem 2.1.28 Let (G,*) be a group a.nd a be an elem.ent of G such that 
o (a) = n. 

(?) If a m = e for som,e positive integer m, then n divides m. 

“Pi mJlc. Ma.£/Le#fta£lea./ ~Pi±y.£-LG-A- 
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(u) For every positive integer t , 

0(a,) = ssfco- 

-Proof. (i) By the division algorithm, there exist p,q E Z such that m = nq+ 
r, where 0 < r < n. Now a T = a Tn ~ nq = a m *a“ n<J = a m *(a n ) _<? — e*(e) _l? = e. 
Since n is the smallest positive integer such that a n = e and a r = e, it follows 
that r = 0. Thus, m. = nq. This implies that n divides m. 

(ii) Let o(a 4 ) = k. Then a kt = e. By (i), n divides kt. Thus, there exists 
r E Z such that kt = nr. Let gcd (t,n) = d. Then there exist integers u and 
v such that t — du and n = dv and gcd(u,u) = 1 by Exercise 9 (page 20). 
Now kt = nr implies that kdu = dvr. Hence, ku = rv. Thus, v divides ku. 

Tl. Tit 77 11 

Since gcd(u,u) — l,v divides k. Thus, ^ divides k. Now (a 4 )^ = a~d = a~ — 
a nu = (a n ) u = e u = e. Since 0 ( 0 ^) = k, k divides Since k and ^ are positive 
integers, k = §. Hence, o(o‘) = k = § = ■ 

A group (G, *) is called a torsion group if every element of G is of finite 
order. If every nonidentity element of G is of infinite order, then G is called a 

torsion-free group. 

The group of Example 2.1.27 is a torsion group. The groups (R, +), (R + , •), 
(Q + , ■) are torsion-free groups. The group (R\{0}, ■) is neither a torsion group 
nor a torsion-free group, since —1 is of order 2 and all other nonidentity ele¬ 
ments are of infinite order. 

We close this chapter with the following example. The ideas set forth in 
this example are due to Klein. 

Example 2.1.29 Im,agine a square having its sides parallel to the axes of a 
coordinate system and its center at the origin. 
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We label the vertices as in the figure and we allow the following rigid mo¬ 
tions of the square: clockwise rotations of the square about the center and 
through angles of 90°, 180°, 270°, 360°, say, rg q, H80) r 270, r 360, respectively; 
reflections h a,nd, v a,bout the horizontal and. vertical axes; reflections d 2 
about the diagonals. The following figures should prove helpful. 



A multiplication * on two rigid motions can be defined by performing two 
such motions in succession. For exam,pie, rgg*h is determined by first perform¬ 
ing motion h and then the m,otion rgo. We see that rgo *h, = d\. The complete 
multiplication table for the operation * follows. 


* 

7*360 

7*90 

7*180 

7*270 

h 

V 

d\ 

d2 

7*360 

7*360 

7*90 

7*180 

7*270 

h 

V 

d\ 

d2 

7*90 

7*90 

7*180 

7*270 

7*360 

d\ 

d2 

V 

h 

7*1 80 

7*180 

7*270 

7*360 

7*90 

V 

h 

d 2 

di 

7*270 

7*270 

7*360 

7*90 

7*180 

d2 

di 

h 

V 

h 

h 

d 2 

V 

di 

7*360 

7*180 

7*270 

7*90 

V 

V 

di 

h 

d 2 

7*180 

7*360 

7*90 

7*270 

di 

di 

h 

d2 

V 

7*90 

7*270 

7*360 

7*180 

d2 

d2 

V 

di 

h 

7*270 

7*90 

7*180 

7*360 


( 2 . 1 ) 


We leave it for the reader to verify that the set of rigid motions is a group 
under the operation *. This group is known as the group of symmetries of 
the square. Let us denote this group by Sym. Then 


Sym, = {r 3 60T90Ti80,r270,h,v,di,d 2 }. 

WiLfuc. 7^o^Lc#fta£uiaZ 
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Since h * r 2 7 o = d\ 7 ^ d 2 = r 2 7 o * h, we see that the Sym, is noncommutative. 
We also note that 7*360 is the identity element. 

Let us now determine the order of the elem.ents. Consider r 9 o. Now 7 g 0 = 
7‘90 * ^90 = 7*180, r 90 = r 90 * r 90 = ^270, a,nd r§ 0 = r| 0 * r 90 = r 360 . Thus, 
°( r 9 o) = 4. Similarly, o(n 80 ) = o(r| 0 ) = gcd 4 42) (by Theorem. 2.1.28) = \ = 2, 
°(r 2 7 o) = 4, o(h) = 2, o(v) = 2, o(d x ) = 2, and o(d 2 ) “ 2. 

Let us write a = r 9 o and (3 = d 2 . Then a 2 = rigo, a 3 = r 2 7 o, = 7*360, 

j3* a = v, f3 * a 2 = c?i, and (3 * a 3 = h. Also, note that (3 * a = a -1 * (3 = a 3 * (3. 
Thus, we see that 

Sym, = {e, a, a 2 , a 3 ,(3,(3 * a, (3 * a 2 , (3 * a 3 }. 

Finally, we make the following observations. Consider vqq. We can think of 
rgo as a one-one function of {1,2, 3,4} onto {1,2, 3,4} by defining rgo(l) = 2, 
r 9 o(2) = 3, r 9 o(3) = 4, r 9 o(4) — 1. In a similar manner, we can consider other 
rigid motions of the square as one-one functions of {1,2,3,4} onto {1,2, 3,4}. 

A fundamental phenomenon of nature is that of symmetry. A figure or an 
object is said to have a symmetry if a rotation, a translation, an inversion, 
a minor reflection, or a combination of these operations leaves the figure or 
object indistinguishable from its original position. The 1890s saw the first 
application of group theory to the natural and physical sciences. An important 
application of group theory was to crystallography. Groups were used to give 
a theoretical classification of the different kinds of symmetry arrangements 
possible within crystalline matter 20 years before experimental means were 
available for analyzing the crystals themselves. 

Group theory is used in quantum mechanics. It is used to study the atom’s 
internal structure. In the 1950s, a new generation of particle accelerators pro¬ 
duced a variety of subatomic particles. Group theory was used to predict the 
existence of a tenth nucleon in a tenfold symmetry scheme of nucleons of which 
nine particles had already been detected. In 1964, the tracks of Omega-Minus, 
the tenth nucleon, were identified. 

2.1.1 Worked-Out Exercises 

0 Exercise 1 Let G = {a £ R | — 1 < a < 1}. Define a binary operation * on 
G by 

L a + b 

a*b = — - - 

1 + ab 

for all a,b G G. Show that (G, *) is a group. 

Solution: Note that — 1 < x < 1 if and only if x 2 < 1 for all x 6 R. 
Let a,b € G. First we show that a * b e G. Now a 2 < 1 and b 2 < 1. Thus, 

“PuJic. TfeatAewtottca/ 
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(1 — a 2 )( 1 — 6 2 ) > 0. This implies that 1 — a 2 — 6 2 + a 2 6 2 > 0. Now (1 + a6) 2 — 
(a + 6) 2 = 1 + a 2 6 2 + 2 ab — a 2 — 6 2 — 2 ab = 1 — a 2 — 6 2 T a 2 6 2 > 0 and so 
Therefore, a * 6 E (7. Hence, G is closed under *. We now show 
that * is well defined. Let a, 6, c, d E G and (a, 6) = (c, g0. Then a — c and 
6 — d. Thus, 


a * b = 


a + b 
1 + ab 


c + d 
1 + cd 


c * d 


and so * is well defined. To show that * is associative, let a, 6, c E G. Now 


(a * 6) * c = 


a + b 
1 -|- a6 


a+b 

l+a.6 


+ C 


i + ( 


l+a6 


) c 


a T & T c ~f~ a6c 
1 T db T dc T be 


Similarly, 


d * (6 * c) 


a + 6 + c+ dbc 
1 + a6 + ac + 6c 


Therefore, (a * 6) * c = a * (6 * c) and so * is associative. Hence, we have shown 
that (G, *) is a semigroup. Now 0 6 G and 


0 * a — 


0 T d 
1 + 0a 


— a 


for all d E G. 


This shows that (G, *) satisfies (i) of Theorem 2.1.22. Let a 6 G. Then —a E G 
and 


Thus, (G, *) satisfies (ii) of Theorem 2.1.22. Consequently, by Theorem 2.1.22, 
(G, *) is a group. 


0 Exercise 2 Let G — {(a, 6) | a, 6 E R, a ^ 0} = R\{0} x R. Define a 
binary operation * on G by 

(a, 6) * (c, d) = (ac, 6 + d) 

for all (a, 6), (c,d) E G. Show that 

(i) (G, *) is a group, 

(ii) G has exactly one element of order 2, 

(iii) G has no elements of order 3. 


Solution: (i) As in Worked-Out Exercise 1, we show that (G, *) satisfies 
the conditions of Theorem 2.1.22. Let (a, 6), (c, d) E G. Then a ^ 0 and c ^ 0 
and so ac ^ 0. Thus, (a, 6) * (c,d) = (ac, 6 -T d) E G. Hence, G is closed under 
*. It is a direct computation to verify that * is well defined and associative, so 
we ask the reader to do the verification. Now (1,0) E G and 

(1,0) * (a, 6) = (la, 0 + 6) — (a, 6) for all (a, 6) E G 

~Pi lHx± 7feaiAe»tatLca/ 
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and so ( G , *) satisfies (i) of Theorem 2.1.22. Let (a, 6) £ G. Then a ^ 0 and so 
\ £ R and ^ ^ 0. Thus, (^, —6) £ (7 and 

(-, -b) * (a, 6) = (-a, -6 + 6) = (1,0). 
a a 

Therefore, ( G , *) satisfies (ii) of Theorem 2.1.22. Hence, (G, *) is a group by 
Theorem 2.1.22. 

(ii) First note that (—1, 0) £ G and (—1, 0) * (—1,0) = (1, 0). Thus, ( — 1,0) 
is of order 2. We now show that this is the only element of order 2 by showing 
that if (a, 6) is any other element of G of order 2, then (a, b) = (—1,0). 

Let (a, 6) £ G be an element of order 2. Then (a, b) * (a, b) = (1,0) implies 
that (a 2 , b + b) = (1,0). Therefore, a 2 — 1 and 6 = 0. Now a 2 = 1 implies that 
a — ±1. If a = 1, then (a, 6) = (1,0), which is a contradiction since (1,0) is 
of order 1. Hence, a = — 1 and so (a,6) = (—1,0). Thus, (-1,0) is the only 
element of order 2. 

(iii) Suppose that (a, 6) is an element of order 3. Then (a, 6) * (a, 6) * (a, 6) = 
(1,0). This implies that (a 3 ,36) = (1,0). Thus, a 3 = 1 and 6 = 0. Now a 3 = 1 
implies that a = 1. Hence, (a, 6) = (1,0). But (1, 0) is of order 1. Consequently, 
G has no element of order 3. 

<0 Exercise 3 Let G be the set of all rational numbers except —1. Show that 
(' G , *) is a group where 


a*6 = a + 6 + a6 


for all a, 6 £ G. 


Solution: As in Worked-Out Exercise 1, we show that (G, *) satisfies 
the conditions of Theorem 2.1.22. Our first step is to show that * is well 
defined. Let a, 6, c, d £ G and (a, 6) = (c, d). Then a = c and 6 = d. Thus, 
a*b = a + b + ab = c-\-d + cd = c*d and so * is well defined. Let a,6 £ G. 
Then a / -1 and 6 ^ —1. We now show that a * 6 £ G by showing that 
a * 6 ^ — 1 and a * 6 is a rational number. Suppose a*6 = a + 6 + a6= —1. Then 
(a+ 1)(6+ 1) = 0. Hence, either (a + 1) = 0 or (6+ 1) =0 and so either a = — 1 
or 6 = — 1, which is a contradiction. Therefore, a * b ^ —1. Since addition and 
multiplication of rational numbers is a rational number, it follows that a * 6 is 
a rational number. Hence, a * 6 £ G. Thus, * is a binary operation on G. Let 
a, 6, c £ G. Then 


(a * 6) * c 


(a + 6 + ab ) * c 
a + 6 + a6 + c + acT6c-j- abc 


a + (6 + c + 6c) + a(6 + c + 6c) 
a + 6 * c + a(b * c) 


a * (6 * 
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This shows that * is associative. Thus, (G, *) is a semigroup. Now 0 £ G and 
0*a = O+a+O-a = a for all a G G. Hence, (G, *) satisfies (i) of Theorem 2.1.22. 
Now for all a 6 G, a + 1 ^ 0. Note that — 7= — 1. Therefore, — ^py € G and 


a 

< 2+1 


* a — 


a 


< 2+1 


+ O + (■ 


a 


a + 1 


)a = 


—a + a + a 2 — a 2 
( 2+1 


= 0 . 


This implies that (G, *) satisfies (ii) of Theorem 2.1.22. Hence, by Theorem 
2.1.22, (G, *) is a group. 


-0 Exercise 4 Let G be a group and x € G. Suppose o(x) = mn, where m 
and n are relatively prime. Show that there exist y, z £ G such that 
x = y*z — z*y and o(y) — m and 0 ( 2 ) — n. 


Solution: Since gcd(m, n) — 1 there exist s,t 6 Z such that 1 = ms + nt. 
Now x = x m5+nt = x ms *:r ni . Let 7 / - x nt and 2 = x ms . Then x = y*z = z*y. 
Now y m = (x nt ) rn = x™^ = e. Hence, o(y) divides m. Similarly, 0 ( 2 ) divides n. 
Suppose o(y) = 7721 and 0 ( 2 ) = 721 . It is an easy exercise to verify that (y*z) 1 = 
y l * z l for all positive integers l. Thus, x mini = (y * z) mini = y 7711711 * 2 rnini — 
e * e — e. Hence, 77272 1mi 721 . But since m,i|m and 721 |n, we must have m, = mi 
and 72 — 721 . 


0 Exercise 5 Let (G, *) be a group of even order. Show that there exists 
a £ G such that a ^ e, a? — e. 

Solution: Let A = {g £ G \ g ^ g -1 } C G. Then e £ A. If g G A, then 
g~ l G A, i.e., elements of A occurs in pairs. Therefore, the number of elements 
in A is even. This implies that the number of elements in {e} U A is odd. Since 
the number of elements in G is even and {e} U A C G, there exists <2 G G such 
that a ^ {e} U A. But then a/e and a £ A. Hence, there exists a G G such 
that o/e and a = a^ 1 or a 2 — e. 

0 Exercise 6 Let (G, *) be a group and a, b G G. Suppose that a*b = b*a~ 1 
and b * a = a * 6 -1 . Show that a 4 = 6 4 = e. 

Solution: Since a *b — b* a -1 , a = b*a~ 1 *6~ 1 . Similarly, b = a*6 _1 *a~ 1 . 
Thus, 6*a = a* 6 -1 = (6*a~ 1 *6 _1 ) *6 -1 — 6*a -1 *6~ 2 . Multiply both sides of 
the equation 6 * a = b * a -1 * b~ 2 by b _1 to get a = a -1 * b~ 2 . This implies that 
a 2 = b -2 . Hence, a 4 = a 2 *a 2 = a 2 *b~ 2 = a* (a*b _1 ) *b _1 = a* (b*a) *b _1 — 
(a * 6) * <2 * b -1 = (6 * a -1 ) * a * b -1 = b * (a -1 * a) * b _1 = b* e* M 1 — e. Also, 
6 4 = a -4 = e. 

Exercise 7 Let (G, *) be a group and a, b G G. Suppose that <2 * b n = b n+1 * a 
and b* a n = a n+l * b for some n G N. Show that a = b = e. 
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Solution: Multiply both sides of the equation a * b n = b n+l * a by b~ n 
to get a = b n+l * a * b~ n . Thus, a 2 = a * a = a* b n+l * a * b~ n = (a* 
b n ) * b * a * b~ n = (b n+1 * a) * 6 * a * b~ n = 6 n+1 * (a * 6) * a * 6 _n . Now 
a 3 = a * a 2 = a * (5 n+1 * (a * 6) * a * b ~ n ) = (a * 6 n ) *6*(a*6)*a* 5 -n — 
(6 n+1 * a) * 6 * (a * 6) * a * b~ n — b n+1 * (a * b) 2 * a * b~ n . Hence, we see that we 
could use induction to obtain 


a n = b n+l * (a * b) n ~ l *a*b~ n (2.2) 

for all n £ N. Also, 

b * a n — a 77-1-1 * 6 

— a * a n * b 

= a * (6 n+1 * (a * 6) 71-1 * a * b ~ n ) * 6 

— a * 6 n+1 * (a * 5) n_1 * a * 6 1_n 

= (a * 6 n ) * b * (a * 6) n_1 * a * 6 1_n 
_ (^ n +! * a) * b * (a * 6) n “ 1 * a * 6 1-71 

= 6 n+1 * (a * 6)” * a * 6 1-n , 

which implies that 

a n = 6 n * (a * f>) n * a * 6 1-n . (2.3) 

From Eqs. (2.2) and (2.3), 

b n+l * (a * 6) n_1 * a * b~ n = b n * (a * 6) 71 * a * 6 1 “ n , 


which implies that 


b * (a * 6) n 1 * a = (a * 6) n * a * b = (a * 6) 


71+1 


Thus, 

(a * 6) n+1 = b * (a * 6) n_1 * a. 

= 6* ((a * 6) * • * • * (a * 6)) *a 

v-v-' 

n —1 times 

= (6 * a) * • * ■ * (6 * a) 

^-vr-" 

n times 

— (6*a) n . 

Interchange the role of a and b to get 

(6*a) n+1 - (a * b) n . 


(2.4) 


(2.5) 


Hence, (a * b) n = (b * a) n+1 = (5 * a) n * (6 * a) = (a * 6) n+1 * (6 * a) and so 
e— (a * 6) * (b * a), which implies that 


a 


= b~ 2 . (2.6) 
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Now 

b* a n = 6 * a 2 * a n ~ 2 — b * b~ 2 * a n ~ 2 = b^ 1 * a n ~ 2 (2.7) 

and 

a n+1 *b = a n ~ l * a 2 * b = a n_1 * 6' 2 * b = a 71 " 1 * 6 _1 . (2.8) 

Thus, from Eqs. (2.7) and (2.8) it follows that 6 -1 * a n ~ 2 = a n_1 * 6 -1 and so 

a n-i = b -i * a n-2 * 6 = ( 6 -i * a * (2.9) 


Now b * a n = a n+1 * 6 implies that 


a n = (b 1 * a * &) n+1 . 

(2.10) 

Hence, a n = (b^ 1 *a*b) n+1 = (b"' 1 *a*b) n ~ 2 ^ib -1 xaxb) 2, = a 
which implies that a = (6 _1 *a*&) 3 = & _1 *a 3 * b. Thus, a 3 *6 = 
b*a=a?*b = a* a 2 *b = a* b~ 2 * b = a * 6 _1 by Eq. (2.6). 

n_1 * (& _1 *a*6) 3 , 
= b*a. Therefore, 
That is, we have 

b * a = a * b~ l . 

(2.11) 

Similarly, 

a * b = b * a -1 . 

(242) 


Now a * & = 6 * a 1 implies that a * b * a = b. Thus, b = a*b*a=a*a*b 1 
[by Eq. (2.11)]. Hence, 

2 l2 

a = o . 


Suppose n is even. Then a 2 = b 2 implies that a n = b n . Hence, a*b n = b n+l * a 
implies that a n+l = a n *b* a and so b = e. Similarly, a = e. Suppose n is odd. 
Let n = 2k + 1. Then a 2k = b 2k . Now a* b n = b n+l * a => a * & 2fc+1 = & 2 * +2 * a 
=4* a * a 2k * 6 = a 2A:+2 * a. Thus, b = a 2 = b 2 . Hence, b = e. Similarly, a = e. 

Exercise 8 (Hays) Let (S', *) be a semigroup. Show that S is a group if and 
only if for all a £ S there exists a unique 6 £ S such that a * 6 * a = a. 


Solution: Suppose for all o £ 5, there exists a unique b £ S such that 
a * b * a = a. Let a £ S. Then there exists b £ S such that a * b * a = a. Thus, 
a*b*a*6 = a*b and so (a * b) 2 = a * 6. Hence, S has an idempotent element. 
If (S', *) is to be a group, then it can have only one idempotent (Corollary 
2.1.12), namely, the identity element. Therefore, first we show that S has only 
one idempotent. 


Suppose e and / are two idempotents in S. Since e * f £ S, there exists a 
unique g such that (e * /) * g * (e * /) = e * /. Now (e * /) * (g * e) * (e * /) = 
(e * /) * # * (e * e) * / = (e * /) * g * e 2 * / = (e * /) * g * (e * /) = e * /. Since 
g is unique such that (e * f) * g * (e * f) = (e * /), it follows that g * e = g. 


Similarly, since (e * f) * (f * g) * (e * f) = (e * /) * g * (e * /) — e * /, the 
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uniqueness of g implies that f * g = g. Also, (e * /) * (p * (e * /) * p) * (e * /) = 
((e * /) * g * (e * /)) * g * (e * f) = (e * /) * <7 * (e * /). Again, the uniqueness 
of g implies that g * (e * f) * g = g. Hence, g 2 = g * g = (g * e) * (f * g) = 
g*(e*f)*g = g. Thus, g is an idempotent. Now g = g*g*g and g*(e* f)*g = g. 
Hence, by the uniqueness of the middle element g = e* f. Therefore, e * / is an 
idempotent. Now (e */)*/* (e * /) = (e * (/ * /)) * (e * /) = (e * /) * (e * /) = 
e * / and similarly (e * /) * e * (e * /) = e * /. By the uniqueness of the middle 
element, it follows that e = /. Hence, 5 has a unique idempotent element. 

Let e be the idempotent element of S. Let a 6 S. Then there exists b € S 
such that a * b * a = a, which implies that (a * 6) 2 = a * 6. Hence, a * b = e. 
Also, a * b * a = a implies that b*a*b*a = b*a. Thus, b * a is an idempotent. 
Hence, b * a = e. Also, a *b * a = a together with a*6 = e = 6*a implies that 
e* a — a = a* e. Therefore, e is the identity element. Since a*6 = e = 6*a, b 
is an inverse of a. Consequently, (5, *) is a group. 

Conversely, suppose (S', *) is a group. Let a E S. Note that a * a -1 * a = a. 
This shows the existence of an element b € S' such that a * 6 * a = a , namely, b = 
a -1 . To show the uniqueness, suppose there exist 6 ,c £ S such that < 2 * 6 *o = a 
and a * c * a = a. Then a* 6 *a = a*c*a and by the cancellation laws, b — c. 
Thus, b is unique such that a * b * a = a. 


2.1.2 Exercises 

1. Which of the following mathematical systems are semigroups? Which are 
groups? 

(i) (N, *), where a * b = a for all a, b € N. 

(ii) (Z, *), where a * 6 = a — b for all a, b G Z. 

(iii) (R, *), where a *b ~ \a\b for all a, b E R. 

(iv) (R, *), where a * 6 = a + 6 + 1 for all a, b € R. 

(v) (R, *), where a*b = a + b — ab for all a, b € R. 

(vi) (Q,*), where a * b = y for all a, 6 G Q. 

(vii) (C7, *), where 


<? = 


< 


a b 
—b a 


a b 
—b a 


0 0 
0 0 


and a.bgR 


and * is the usual matrix multiplication. 

(viii) (G, *), where G is the set of all matrices of the following form over 

Z 

1 a 6 
0 1 c 

0 0 1 J 
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and * is the usual matrix multiplication. 


2. Let G — {(a, b) | a, b E R, b ^ 0}. Define a binary operation * on G by 
(a, b ) * (c, d) — (a + be, bd ) for all (a, b), (c, d) E G. Show that (G, *) is a 
noncommutative group. 


3. Let G ~- 


a b 
c d 


a,b,c,d £ R, ad — be = 1 > . Show that G is a group 


under usual matrix multiplication. (This group is usually denoted by 
SL(2 , R) and is called the special linear group of degree 2.) 


4. Let G - 


1 n 
0 1 


n E Z > . Show that ( G , *) is a commutative group, 


where * denotes the usual matrix multiplication. Also, show that (G, *) 
is torsion-free. 


5. In Z14, find the smallest positive integer n such that n[6] = [0]. 

6. Find an element [6] E Zg such that [8] -g [b\ — [1]. Does [b] E Ug? 

7. In I/24, find the smallest positive integer n such that [7] n = [1]. 

8. Describe Uq,Uq,U\ 2 , U 24 of Example 2.1.6. 

9. Let p be a prime. Show that U p = Z p \{[0]}. 

10. Let U n = {[a] E Z n \{[0]}| gcd(a,n) = 1}. Show that (U n , • n ) is a group, 
where - n is multiplication modulo n. 

11. Show that U n = {[a] E Z n \{[0]} | additive order of [a] = n }. 

12. Let (G, *) be a group and a, b E G. Suppose that a 2 = e and a*b A *a = b 7 . 
Show that 6 33 = e. 

13. Let (G, *) be a group and a, b E G. Suppose that a -1 * b 2 * a —(P and 
6 -1 * a 2 * b — a 3 . Show that a = b = e. 

14. Let (G, *) be a group. If a, b E G are such that a 4 = e and a 2 * b = b * a, 
show that a = e. 

15. Let (G, *) be a group and x, a, b E G. Let c = a;*a*:r -1 and d = x*b*x~ 1 . 
Show that a*6 = 6 *aif and only if c* d = d* c. 

16. Let (G, *) be a group such that a 2 — e for all a E G. Show that G is 
commutative. 

17. Prove that a group (G, *) is commutative if and only if 
for all a, b E G. 
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18. Let (G, *) be a group. Prove that if (a * b) 2 = a 2 * b 2 for all a, b E G, then 
(G, *) is commutative. 

19. Prove that a group (G, *) is commutative if and only if for all a, b £ G, 
(a * b) n = a n * b n for any three consecutive integers n. 

20. Let (G, *) be a group. If G has only two elements, prove that G is 
commutative. 

21. Let (G, *) be a group and a, b, c €1 G. Find an element x G G such that 
a * x * b = c. Is 2 : unique? 

22. Let ( G , *) be a group and a,b 6 G. Show that (o * b* a~ 1 ) n — a*b n * a~ l 
for all integers n. 

23. Let (G, *) be a finite group and a 6 G. Show that there exists n € N 
such that a n = e. 

24. If (G, *) is a group and ai, ..., a n £ G, prove that (a* * ■ • • * a„) _1 = 
a" 1 

25. Let (G, *) and (if, •) be groups. Define the operation * on G x H — {(a, 6) 

| a 6 G, 6 6 77} by (a, 5) * (c, d) = (a * c, b ■ d). Prove that (G x 77, *) is 
a group. If (G, *) and (if, ■) are commutative, prove that (G x if,*-) is 
commutative. The group (G x H, *-) is called the direct product of G 
and if. 

26. Let (G, *) be a finite group and a E G. Show that o(a) < |G|. 

27. Let (G, *) be a group and a, 6 6 G. 

(i) Show that a and a -1 have the same order. 

(ii) Show that a and b * a * have the same order. 

(iii) Show that a * b and 6 * a have the same order. 

28. Let (G, *) be a group and a, b £ G. 

(i) Suppose that a*6 = & 5 *a 3 . Show that o(6*a -1 ) = o(6 5 *a) = o(6 3 *a 3 ). 

(ii) Generalize (i) to arbitrary powers of a and b. 

29. Let (G, *) be a group, a (E G and o(a) = n. Let 1 < p < n be such that p 
and n are relatively prime. Show that o{a p ) = n. 

30. Let (G, *) be a group, a 6 G, and o(o) = p, where p is a prime. 

(i) Show that o(a fc ) = p for all 1 < k < p. 

(ii) Show that for all m E N, either a m = e or o(a 771 ) = p. 
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31. Let (G y *) be a group and a G G. Suppose that o(a) = n and n = m.k for 
some m, k G Z. What is o(a fc )? 

32. (i) Let (Gy *) be a group, a,b G Gy o (a) = n, o(b) — m, gcd(m, n) — 1, 
and a * b — b * a. Show that o(a *b) = mn. 

(ii) Let ( 6 ?,*) be a group, G G, o(a*) = n*, 1 < i < m. Suppose 
gcd(ni, nj) — 1 and diCij = ajdi for all i and j. Let x = aq * 02 * • • • * flm- 
Show that o(x) — n\Ti 2 ■ ■ • rim. 

33. Let (Gy*) be a group and x G G. Suppose o(x) = n — n\n 2 njt, 
where for all i ^ j, rii and are relatively prime. Show that there exists 
Xi G G such that o(xi) — rii for all i = 1,2,..., k, x = x\ * X 2 * • ■ • * x^ 
and Xi * Xj = Xj * Xi for all i and j. 

34. Let G = {(a, b) | a, b G R, a 7 ^ 0}. Then G is a group under the binary 
operation (a, b) * (c, d) = (ac, bc + d ) for all (a, b ), (c, d) G G. Show that G 
has infinitely many elements of order 2 , but G has no element of order 3. 

35. Let a, 6 G Sym. As remarked in Example 2.1.29, every rigid motion of 
the square can be considered a one-one function of {1, 2, 3,4} onto itself. 
Consider a*b as a function. Show that < 2*6 = aob, where * represents the 
binary operation of rigid motions of the square and o is the composition 
of functions. 

36. Let (Sy *) be a finite semigroup. Prove that there exists a G S such that 

2 

a = a. 

37. Let (Gy *) be a finite semigroup with identity. Prove that (G, *) is a group 
if and only if G has only one element a such that a 2 = a. 

38. Prove that a semigroup (5, *) is a group if and only if a * S = S and 
S * a ~ S for all a G S, where a* S = {a* s j s G S} and S * a = {s * a \ 
s G S}. 

39. Prove that a semigroup (S, *) is a group if and only if 

(i) there exists e G S such that a * e = a for all a G S, and 

(ii) for all a G S there exists b G S such that a* b = e. 

40. Rewrite the statements and proofs of the theorems in this chapter using 
additive notation. 


41. Let (G, *) be a group, a, b G G and m,n G Z. Prove that 

(i) a n * a™ = a n+m = a m * a 71 , 

(ii) (a 71 ) 771 = a nrn y 
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(iii) a~ n — (a n ) _1 , 

(iv) e n = e, 

(v) (a * b) n = a n * b n , if (G, *) is commutative. 


42. Write the proof if the following statements are true; otherwise, give a 
counterexample. 

(i) Let T(iS') be the set of all functions on S = { 1 , 2,3}. T(S) is a group 
under composition of functions. 

a b 


(ii) M 2 (R) 


c d 


a, 6, c, d E R > is a group under usual matrix 


multiplication. 

(iii) Every group of four elements is commutative. 

(iv) A group has only one idempotent element. 

(v) A semigroup with only one idempotent is a group. 

(vi) If a semigroup S satisfies the cancellation laws, then S is a group. 
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Niels Henrik Abel (1802-1829) was 
born on August 5, 1802, in Finnoy, Nor¬ 
way. He was the second of six children. Abel 
and his brothers received their first educa¬ 
tion from their father. 

At the age of 13, Abel along with his 
older brother, was sent to the Cathedral sch¬ 
ool in Christiania (Oslo). In 1817, his math¬ 
ematics teacher was Bernt Michael Holmbe, 
who was seven years older that Abel. Holmbe 
recognized Abel’s talent and started giving 
him special problems and recommended spe¬ 
cial books outside the curriculum. Abel and 
Holmbe read the calculus text of Euler and 
the work of Lagrange and Laplace. Soon 
Abel became familiar with most of the im¬ 
portant mathematical literature. 

Abel’s father died when he was 18 years old and the responsibility of supporting 
the family fell on his shoulders. He gave private lessons and did odd jobs. However, 
he continued to carry out his mathematical research. 

Abel, in his last year of school, attacked the problem of the solvability of the quintic 
equation, a problem that had been unsettled since the sixteenth century. Abel thought 
that he had solved the problem and submitted his work for publication. Unable to 
find an error and understand his arguments, he was asked by the editor to illustrate 
his method. In 1824, during the process of illustration he discovered an error. This 
discovery led Abel to a proof that no such solution exists. He also worked on elliptic 
functions and in essence revolutionized the theory of elliptic functions. 

He traveled to Paris and Berlin in order to find a teaching position. Then poverty 
took its toll, and Abel died from tuberculosis on April 6, 1829. Two days later a 
letter from Crelle reached his address, conveying the news of his appointment to the 
professorship of mathematics at the University of Berlin. 

Abel is honored by such terms as Abelian group and Abelian function. 






Chapter 3 

Permutation Groups 


Permutation groups is one of the specialized theories of groups which arose 
from the source, classical algebra, in the evolution of group theory. 

3.1 Permutation Groups 

As stated earlier, there are four major sources from which abstract group the¬ 
ory evolved. Mathematicians’ interest in finding formulas to solve polynomial 
equations by means of radicals led some mathematicians to the study of permu¬ 
tations of the roots of rational functions. Lagrange, Rufini, and Cauchy were 
among the earlier mathematicians to work with permutation groups. However, 
it was Cauchy whose systematic study of permutation groups (between 1815 
and 1845) is believed, by some, to be the origin of abstract group theory. Many 
of the concepts and major results in this chapter are due to Cauchy. 

We begin our study of permutation groups by defining what a permutation 
is. 

Definition 3.1.1 Let X be a nonempty set. A permutation ir of X is a 
one-one function from, X onto X. 

Definition 3.1.2 A group (G,*) is called a permutation group on a nonem¬ 
pty set X if the elements of G are permutations of X and the operation * is 
the composition of two functions. 

Example 3.1.3 Let X be any nonempty set and Sx be the set of all one-one 
functions from, X onto X, as defined in Example 2.1.9. Then (Sx,°) is a group 
as we have shown in Example 2.1.9, where o is the composition of functions. 
Hence, (Sx,°) is a permutation group. 

In this chapter, and in fact in this text, our study of permutation groups 
will focus on permutation groups on finite sets, i.e., X is a finite set. 
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Before we consider more examples of permutation groups, let us fix some 
notation which will be useful when working with permutations. 

Let I n = {1,2,..., n}, n > 1. Let ir be a permutation on I n . Then 


?r = {(l,7r(l)),(2,7r(2)),...,(n,7r(n))}. 


(Recall that a function f : A —> A is a subset of A x A) It is sometimes 
convenient to describe a permutation by means of the following notational 
device: 


7r — 


12 3 n 

7r(l) 7t(2) 7t(3) • • - 7r(n) 


This notation is due to Cauchy and is called the two-row notation. In the 
upper row, we list all the elements of I n and in the lower row under each 
element i G J n , we write the image of the element, i.e., 7 r(i). 


Example 3.1.4 Let n = 4 and ir be the permutation on I 4 defined by 7 r(l) = 2, 
7t(2) = 4, 7t(3) = 3, and 7 r( 4 ) = 1. Then using the two-row notation we can 
write 


7T = 


1 2 3 4 \ 

2 4 3 1 J 


As we shall see, the two-row notation of permutations is quite convenient 
while doing computations such as determining the composition of permuta¬ 
tions. 

Let n = 7 and 7 r and a be two permutations on I 7 defined by 


/l 234567\ 
^ 1 3 4 6 7 2 5 ) 


and 


( 1 2 3 4 5 6 7 \ 

^ 2 5 3 1 7 6 4 )’ 


Let us compute 7r o g. Now by definition, (77 o <r)( 2 ) = 7r(<7(f)) for all i 6 I 7 . 
Thus, 

(tt o cr)(l) = tt((j( 1)) = 7r(2) = 3, 


( 7 T o cr)(2) = 7 r(cr( 2 )) = 7 r( 5 ) = 7 

and so on. From this, it is clear that when determining, say, ( 7 r o cr)(l), we 
start with cr and finish with 7 r and read as follows: 1 goes to 2 (under a) and 
2 goes to 3 (under 7 r) and so 1 goes to 3 (under 7 r o a). We can exhibit this in 
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the following form: 


1 ^ 2 ^ 3 


2 - 45-47 


1^3 

2^7 


3 ^ 3^4 


S’toct 


4 -4 1 -4 l 


4 7roa 


5 _4 7 5 


6 A 6 A 2 


744-^6 


5™ 5 
6 ^ 2 
7 ™ 6 . 


Thus, 


7T O cr = 


1 2 3 4 5 6 7 

3 7 4 1 5 2 6 


Example 3.1.5 Let n — 6 and a a,nd {3 be permutations on Iq defined by 


1 2 3 4 5 6 


3 1 4 6 5 2 


1 2 3 4 5 6 

1 3 5 4 2 6 


Let us first determine a o (3. Now 1 1 -4 3, i.e., 1 3. Similarly, 2 4, 

3 °^ 5, 4 °^S 6 , 5 1, 6 a -^ 2 . 77ms, 


ao (3 — 


1 2 3 4 5 6 


3 4 5 6 1 2 


Similarly, for (3 o a; 1 -4 3. —> 5, i.e., 1 5 and so on. In this case, we start 


with a and finish with (3. Note that 


(3 o a = 


1 2 3 4 5 6 


5 1 4 6 2 3 


We note that a o (3 (3 o a. 


Let S n denote the set of all permutations on I n , n > 1 . 


Example 3.1.6 In this example, we describe 63 , i.e., the set of all permuta¬ 
tions on I 3 — {1,2,3}. From, Exercise 8 (page 50,), we know that the num,ber 
of one-one functions of I 3 onto I 3 is 3! = 6 . Thus, |JS 3 1 = 6 . Let e denote the 

identity permutation on I 3 , i.e., e = l . Let a\ be a nonidentity 


permutation on I 3 . Let us see som,e of the choices for a\. Suppose au(l) = 1. 
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If oil (2) = 2, then we must have o:i(3) = 3 since a\ is a permutation. In this 
case, we see that ai = e, a contradiction. Thus, we must have ai(2) = 3 a.nd 


ai(3) = 2, i.e., a\ 


1 2 3 
1 3 2 


. In a similar m.anner, we can show that 


the other four permutations on 1% a,re <22 = 


1 2 3 

2 1 3 


a 4 


2 3 1 ’ aWas 


1 2 3 
3 1 2 


. Thus, 


<23 


1 2 3 \ 

3 2 1/’ 


53 = {e,Q'i,Q'2,ia;3,Q!4,Q!5}. 


Le£ us denote by a and a 4 by (3. We ask the reader to check that (3 2 = a$, 
a o (3 = ai, a,nd a o f3 2 — 012 ,■ Hence, we can write 


S 3 = {e,/3,/3 2 ,a,o ; o/3,ao^ 2 }. 


Since ( 53 , 0 ) is also a group, we ask the reader to show that 0 ( 0 ) = 2 and, 
o(/ 3 ) =3 by showing that c? — e and (3 2 7 ^ e, 6 ui /3 3 = e. 


In the previous example, the permutation group (Ss,°) consisted of all 
permutations on the set 1%. Next, we give an example of a permutation group 
that does not contain all permutations on a given set. 


Example 3.1.7 Let n = 4 and consider I 4 = {1,2,3,4}. Recall that in Ex¬ 
ample 2.1.29, we remarked that rigid motions of the square can be viewed as 
permutations on I 4 . Let S be the set of all permutations that corresponds to 
the rigid motions of the square. We will use the sam.e notation for the per¬ 


mutations, i.e., rgo is the permutation 


12 3 4 
2 3 4 1 


, r 36 o is the identity 


permutation, etc. By Exercise 35 (page 80), it follows that the multiplication 
table of(S, o) is the sam,e as the multiplication table of the group ( Sym., *). Now 
composition of functions is associative and from, the multiplication table, it fol¬ 
lows that S is closed under o, r 36 o is the identity of (S', o), and, every element 
of S has an inverse. Thus, (S, o) is a group. Hence, the group of symmetries 
of a square can be thought of as a permutation group on I 4 . 


The following theorem describes some basic properties of S n . 


Theorem 3.1.8 (i) (S n , o) is a group for any positive integer n > 1 . 
(ii) If n > 3, then (S n , °) is noncommutative. 

(in) ]S n | = n! 
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Proof. (i) We have already noted that the set of all one-one functions of 
any nonempty set onto itself forms a group under composition of functions in 
Example 2.1.9. Thus, (S n , o) is a group for any positive integer n > 1. 

(ii) Let n > 3. Let a, (3 € S n be defined by 


( 1 2 3 4 
^ 1 3 2 4 



12 3 4 
3 2 14 



Now 


and 


a o (3 — 


12 3 4 
2 3 14 


n 

n 


P o a — 


12 3 4 
3 12 4 


77, 


n 


Thus, (o o /3)(1) = 2^3 = (P o o)(l). Hence, a o p ^ p o a and so S n is 
noncommutative. 

(iii) This follows from Exercise 8 (page 50). 


Definition 3.1.9 The group (S n ,o) is called the symmetric group on I n . 


Consider the permutation 7r = 


7r(l) tt(2) 


then we drop the column ,. For example, a — 

7T(7) 


77, 

• ■ 7T (n) 

12 3 4 
14 3 2 


. If 7 r(i) = i, 


is denoted 


by 


2 4 
4 2 


Definition 3.1.10 Let 7r be an element of S n . Then it is called a k-cycle, 
written (H 72 • • • ik), if 


( %l 7 2 • • • 7 k-\ ik 
7T = . 

\ l 2 n Ik H 

i.e., 7r(ij) = ij+i, j == 1,2,..., k — 1, 7r(ik) — ii, and it (a) = a for any other 
element of I n . 

Note that if 7r = (i\i2 • • • ifc), then 

7T = (iii2 •■■**) 

= (?2^3 

— (bb+i' * ■ ■ ■ ‘ij— i)* 
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A fc-cycle is called a transposition when k = 2 . 

We know that in Example 3.1.7, the permutation rgo is a 4-cycle and is 
a 2-cycle. We write 

r 90 = (1 2 3 4) 

and 

d 2 = (1 3). 

The identity of S n is sometimes-denoted by ( 1 ) or e. 

Example 3.1.11 Using the cycle notation, we can write 

5 3 = {e,(12),(l 3), (2 3), (1 2 3), (1 3 2)}. 

We now note some of the properties of the group (S 3 , o). 

(i) (£ 3 , 0 ) is a. noncom.m.utative group of order 6 by Theorem 3.1.8. 

(it) S 3 contains two elements of order 3; for (1 2 3) o (1 2 3) = (1 3 2) / e 
and (1 2 3) o (1 2 3) o (1 2 3) = e. Hence, the order of (1 2 3) is 3. Similarly, 
the order of ( 13 2) is 3. The order of (l 2), (1 3), and (2 3) is 2 since (1 2) o (1 

2) - e, (1 3) o (1 3) = e, and (2 3) o (2 3) = e. ' 

(Hi) In S 3 , the product of distinct elements of order 2 is an elem,ent of order 
3. (1 2 ) o (2 3) = (1 2 3), (1 3) o (1 2 ) = (1 2 3), (1 2 ) o (1 3) = (1 3 2 ), (2 

3) o (1 2) = (1 3 2), (1 3) o (2 3) = (1 3 2), and (2 3) o (1 3) = (1 2 3). 

Definition 3.1.12 Let a, (3 E S n . Then a and (3 are called conjugate if there 
exists 7 E S n such that 

7 o a o 7 -1 = (3. 

The following theorem shows how to compute the conjugate of a cycle. 
Theorem 3.1.13 Let n = (i\i 2 • • -i{) E S n be a cycle. Then for all a E S n , 

a on o a -1 = (a(ii) 0 ( 72 ) ■ * • a(ii)). 

Proof. Since a E S n , a is a one-one mapping of I n onto I n . Thus, the ele¬ 
ments o(l),..., oc(n) E I n are all distinct and so I n = {a(l), a(2),..., a(n)}. 
Let r be any integer such that 1 < r < l. Then 

(ao7roa; _ 1 )(a(i r )) = a(7r(a - 1 (o:(i r )))) 

= a(7r(t r )) 

= a(i r _|_i). 

Also, (ao 7 roa _ 1 )(Q;(i/)) = a(' 7 r(a~ 1 (a(z;)))) = a( 7 r (i[)) = a(i\). Now let a E I n 
be such that a 7 ^ a(i r ) for all r, 1 < r < l. Then o - 1 (a) E I n and a -1 (a) 7 ^ i T 
for all r, 1 < r < l , and so 7 r(a - 1 (a)) = a -1 (a). Thus, 

(ao7roa _ 1 )(a) = a(7r(a _ 1 (a))) 

= a(a _ 1 (a)) 

= a. 

“Pi 
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It now follows that aos o cx 1 = (or(ii) 0 ( 22 ) • ■ • a(i*)). ■ 

Definition 3.1.14 Let 7ri,7T2,. .. , 7ffc G S n . Then 772 ,. .., Sk ore called dis¬ 
joint if for all i, 1 < i < k and for all a G I n , 7 q(a) 7 ^ a implies 7 rj(a) = a /or 
all j ^ i, 1 < j < k. 

In other words, 7 Ti, 7T2, ... ,77*. G £ n are disjoint if for all 1 < i < fc and 

for all a G 7 n , if 77^ moves a, then all other permutations 7 Vj must fix a, i.e., 

TTj (a) = a for all j 7 ^ i, 1 < j < k. > >-. 

Let 7r and A be disjoint permutations on I n . Let a G S be such that 77 (a) 7 ^ a. 
Then A(a) — a. Let 7 r(a) — b. Then ( 7 r o A)(a) = 7r(A(a)) = 7r(a) = b. Also, 

(A o 7r)(o) = A(7r(o)) — A(6). If 7 r(6) = 6, then 7 r(b) = 6 = 7r(a) and so a = b. 
Thus, 7r(a) — b = a, a contradiction. Hence, 77 ( 6 ) 7 ^ 6 and so A(6) = b. Thus, 

(A o 7 r)(a) = A( 7 r(a)) = A( 6 ) = 6 . Hence, ( 7 r o A)(a) = (Ao 7 r)(a). Suppose 
7 r(a) = a. If A(a) = a, then (77 o A)(a) = 0 = (A o it )(a). Suppose A(a) 7 = a. By 
a similar argument as before, (77 o A) (a) — (X o 7 r)(a). Therefore, 7 r o A = A o 77. 
Consequently, if 77 and A are disjoint permutations, then they commute. 

Consider 7 r = ^ * 5 \ g 3 7 g € 5 n - Then 7 r = (1 2 5 3) o (4 

8) o (6 7) can be written as a product of disjoint cycles. This leads us to the 

following theorem. 

Theorem 3.1.15 Any nonidentity permutations of S n (n > 2) can be uniquely 
expressed (up to the order of the factors) as a product of disjoint cycles, where 
each cycle is of length at least 2. 


Proof. We prove the result by induction on n. Suppose n — 2. Now (£2 1 = 2 

f 1 2 \ 

and the nonidentity element of £2 is a = ( ^ ^ j ■ Now a — (1 2), i.e., a is a 

cycle. Thus, the theorem is true for n = 2. Suppose n > 2 and the theorem is 
true for all £* such that 2 < k < n. Let 7 r be a nonidentity element of £ n . Now 
7 r*(l) G I n for all integers i, i > 1. Therefore, { 7 r(l), 7 t 2 ( 1 ), ..., 7 T 1 (1), ...} C I n . 
Since I n is a finite set, we must have s l (l) = s m {l) for some integers l and m 
such that l > m > 1. This implies that — 1 . Let us write j = l — m. 

Then j > 0 and 77 ^ ( 1 ) — 1 . Let i be the smallest positive integer such that 
7 r*(l) = 1 . Let 

A = {l,7r(l),7r 2 (l),... ,7r* -1 (l)}. 


Then all elements of the set A are distinct, 
defined by 


r = (1 7r(l) 7t 2 (1) • • • 


Let r G £ n be the permutation 


i.e., r is a cycle. Let B — I n \A. If B — 0, then 7 r is a cycle. Suppose B 7 ^ <f>. 

Let a = s\b- If o is the identity, then 7 r is a cycle. Suppose that cr is not the 
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identity. Now by the induction hypothesis, a is a product of disjoint cycles on 
B , say, <7 = o\ o cr 2 o • • • o a T . Now for 1 < i < r, define 7r £ by 


7Ti(a) 


cr z (a) if a 6 5 
a if a ^ B. 


Then 7 Ti, 7T2, ..., 7r r and r are disjoint cycles in S n . It is easy to see that tx — 
7Ti o 7T2 o • • ■ o 7r r o r. Thus, 7T is a product of disjoint cycles. 

To prove the uniqueness, let 7r = tx\ o 7r2 o • • • o tt t = pi o p 2 ° • • ■ ° /i s , a 
product of r disjoint cycles and also a product of s disjoint cycles, respectively. 
We show that every 7r* is equal to some p,j and every pk is equal to some 7r £ 
Consider 7ly, 1 < 2 < r. Suppose 7r* = ( 21^2 b)- Then 77 ( 21 ) This implies 
that i\ is moved by some pp By the disjointness of the cycles, there exists 
unique pj, 1 < j < s, such that H appears as an element in pj. By reordering, 
if necessary, we may write p 3 = (ii c 2 ... c m ). Now 

i 2 = 7T*(ii) = 7r(2i) = = c 2 

h = TXiin) = 77 ( 22 ) = tt(c 2 ) = jUy(c 2 ) = c 3 


il = 7T^(b_i) = 7T(b_l) = 7r(Q_x) = /ij(cj_i) = Q. 

If l < m, then b = 77 £ (b) = 77(b) = 7 r(c/) = Pj(ci) = c £+ i, a contradiction. 
Thus, £ = m. Hence, 7r £ = pj for some j, 1 < j < s. Similarly, every p^ ~ 7 r £ 
for some t, 1 < t < r. ■ 


Corollary 3.1.16 Let n > 2. Any permutation 1 r of S n ca,n be expressed as a 
product of transpositions. 


Proof. In view of the preceding theorem, it suffices to show that every Re¬ 
cycle can be expressed as a product of transpositions. This fact is immediate 
from the following equations: 

e = (!) = (! 2) o(l 2) 


and for k > 2 

Oh 72 • • • ik) = (h ik) 0 (ii ik- 1 ) o • • • o (H i 2 ), 
where {21,22 • ■ ., ik} T bn- ■ 

Let tx E S n . Since Sh is a finite group, we know that 0 ( 77 ) is finite. Thus, in 
order to find the order of 7T, we need to compute 7r, 7 t 2 , 7t 3 , ..., until we find the 
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first positive integer k such that Tv k — e. Finding such a positive integer could 
be a tedious task. However, we can effectively make use of the decomposition 
of 7 r as a product of disjoint cycles, compute the order of each cycle, which is 
nothing but the length of the cycle (Exercise 17, page 97) and from the order 
of the cycles deduce the order of tv. We ask the reader to consider this problem 
in Exercise 18 (page 97). 

Theorem 3.1.15 tells us that any permutation a € S n , n > 2, can be written 
as a product of disjoint cycles. However, the theorem does not tell us how to 
find the disjoint cycles in the decomposition of a. Next, we illustrate how to 
find these cycles. 

Let tv be a permutation on I n , n > 2 . In order to express tv as a product 
of disjoint cycles, first consider 1, 7r(l), 7r 2 (l), 7r 3 (1), ... and find the smallest 
positive integer r such that zr r (l) = 1. Let 

<71 = (1 Tf(l) 7T 2 (1) 7T r_ 1 (l)). 

Then a\ is a cycle of length r. Let i be the first element of I n not appearing in 
(7 1 . Now consider i, 7 r(z), tv 2 ({), zr 3 (z), ... and find the smallest positive integer 
s such that zr 5-1 ^) = i. Let 

<72 = (i TV (i) TV 2 {i) • ■ • 7T S- 1 (z)). 


Then a 2 is a cycle of length s. Now 

{1,7t(1),7T 2 (1), . . . ,7r r_1 (l)} n {z,7t(z),7T 2 (z), . . . ,7T S ' 1 (z)} = 0 , 

for if j e { 1 , tt( 1 ), tt 2 ( 1 ), ..., tv 1 1 (1)} n {i, t v(i), tt 2 (z), ..., tt 5-1 ^)}, then 
j = tv p (z) for some p, 1 < p < r, and j — 7r fc (l) for some k, 1 < k < s. 
Thus, {1, 7 r(l), 7 t 2 ( 1 ), ..., 7 r r- 1 (l)} = {z, t r(z), Tv 2 (i), ..., 7 r s—1 (i)}, which is 
a contradiction. Hence, <j\ and a 2 are disjoint cycles. If { 1 , 7 r(l), tt 2 (1), ..., 
7 r r_1 (1)} U {z, 7 r(z), 7 r 2 (z), ... , 7 r s_ 1 (z)} ^ I n , then consider the first element of 
I n not appearing in { 1 , tt( 1 ), 7 r 2 (l), ..., ^"^l)}U {z, ?r(z), tt 2 (z), ..., 7 r s_ 1 (z)} 
and continue the above process to construct the cycle ( 73 . Since I n is finite, the 
above process must stop with some cycle cr m . Then zr = <Ji o <72 o ■ • • o cr m . 

We illustrate the above procedure with the help of the following example. 


Example 3.1.IT Consider the permutation 


_/l 234567^ 

" V 6 3 5 2 4 7 lj 

on lj. Here 7 r(l) = 6 , 7 r 2 (l) = zr( 6 ) = 7, and 7 t 3 ( 1 ) = 7 f( 7 ) = 1. That is, 
1 6 7 —*■ 1. Hence, ny = (1 6 7) is a 3-cycle. Now 2 is the first, element of 
I 7 not appea,ri,ng in (16 7). Also, tv(2) = 3, zr 2 ( 2 ) = zr(3) = 5, zr 3 (2) = zr(5) = 4, 
and 7 t 4 (2) = 7 t(4 ) — 2. That is, 2 —> 3 —* 5 ^ 4 2. Hence, g-i = (2 3 5 4) is 

a cycle of length 4. Now < 7 \ and 02 are disjoint and tv = o~i o (7 2 . 
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While writing a permutation as a product of disjoint cycles, it is customary 
not to write cycles of length one in the product. Thus, if some element of 
I n does not appear in any of the cycles, then it is assumed to be fixed. For 
example, if tt = (1 2 5) o (4 6 ) € S 7 , then since 3 and 7 neither appear in (1 2 
5) nor in (4 6 ), they are fixed, i.e., 7 r( 3 ) = 3 and 7 r( 7 ) = 7. 

Given a permutation it 6 S n , n > 2 , we can write it as a product of disjoint 
cycles. We can also write tt as a product of transpositions. However, the 
representation of tt as a product of transposition need not be unique. For 
example, (12 3) = (1 3) o (1 2) = (2 1 ) o (2 3). Also, (1 3) = (1 2 ) o (1 3) o (2 
3). That is, (1 3) can be written as a product of one transposition or as a 
product of three transpositions. However, we will show that the number of 
transpositions in any representation of a permutation is either even or odd, 
but not both. We now proceed to prove this result. 

Consider the formal product 

A ni<i<j<n( a i — a j) = ( a l “ a 2)( a l — a 3 ) ' ‘ ‘ ( a l — a n) 

(02 — 03 ) • • • (02 - On) 

(Un —1 On)- 

If n = 4, then X — (ax — a, 2 )(ai — a 3 )(ai — 04 )(a 2 — a.3)(o2 — 04 X 03 — 04 ). 

For any permutation 7 r € 5 n , let 

77 (A’) = ( a iv(i) ~ a TV(j))‘ 

l<i<j<n 

Let us first examine <j(X) for any transposition a G S n . 

Lemma 3.1.18 Let n > 2. Let a = (i j) € S n , i < j, be a transposition. Then 
a(X) = -X. * 

Proof. First consider the factor (a* — aj ) in the product X. The correspond¬ 
ing factor in cr{X) is CL<j(i) — a a(j)- Now 

^cr(j) Oj O'! (o-j Oj). 

Next, consider the factor — o;, where both k and l are neither equal to i nor 
equal to j. The corresponding factor in cr(X) is a a ^ — a a ^ and 

Q'cr(k) ^cr(Z) ' Ofc O l■ 

Thus, the factor a & — a/ remains unaltered. Now consider the factor — a/, 
where either k or l (but not both) is equal to i or j. Let 1 < t < n. Suppose 
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t < i < j. We have the pair of factors (a t — af) and (at — dj) in the product X. 
The corresponding factors in <j(X) are a a ^ — a a ^ and — a a ^ and 

(^■( 7 (£) ®o , (i))( < ^a'(t) — ( a t — aj)(a t &i) = &?)(&£ a j)- 

Therefore, the product ( a t — a^)(at — aj) remains unchanged. Now suppose 
i < t < j. Then we have the pair of factors (a* — at) and (aj — aj) in the 
product X. The corresponding factors in cr(X) are a a ^ —a a ^ and u a (£) — a a(j) 
and 

(^cr(i) ®er(£)) (^cr(f) ^er (j)) — (®j dt){dt df) = (<2 i dt)(dt dj). 

Hence, the product (a* — at) (at — dj) remains unaltered. Finally, let i < j <t. 
Then we have the pair of factors (a* — at) and (dj — at) in the product X. The 
corresponding factors in cr(X) are a a ^ — a CT ( t ) and a a(j) ~ a cr(t) and 

( a i?(i) ~ a cr(t))i a a(j ) a cr(t )) ( a j a t)(d{ dt) = (ai dt)(aj df) . 

Therefore, the product (a* — a t )(dj — a t ) remains unaltered. Thus, all factors 
other than a* — dj and a/. — a/, where both k and l are neither equal to i 
nor equal to j, can be paired so that the product of factors under a remains 
unaltered. Hence, it now follows that cr(X) = —X. ■ 

Theorem 3.1.19 Let n > 2. Let 7r E S n . Suppose 

7T = (JlO(T20‘‘’0<J r =TlOT2 0- -- OT s , 

where cq, Tj E S n are transpositions, i = 1,2, ..., r, and j = 1,2, ..., s. Then 
both r and s are either even or odd. 

Proof. By Lemma 3.1.18, o~i(X) = —X and Tj(X) — — X for all i = 1,2, 

..., r, and j = 1,2, ..., s. First we compute (&i o <7 2 o • • • o a r )(X). Now 

(<7l o cr 2 o ■ ■ ■ o (T r )(X) = cr 1 (<j 2 (- • • (a r (X)))) 

= \-l) r X. 

Similarly, (t\ o r 2 o ■ • • o r s )(X) = (—1 ) S X. Hence, (—l) r — (—l) s . Thus, both 
r and s are either even of odd. ■ 

By the above theorem, if 7r E S n , then 7T can be written as a product of 
either an even or an odd number of transpositions, but not both. This leads 
us to the following definition. 

Definition 3.1.20 Let 7r E S n . If tv is a product of an even number of trans¬ 
positions, then 7T is called an even permutation; otherwise -k is called an odd 

permutation. 

Corollary 3.1.21 Let i r E 5 n be a k-cycle. Then tv is an even permutation if 
and only if k is odd. 

~Pu*uc- 7feaiAe»taftea/ 
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Proof. Let 7r = (1 2 • • • k). Then 7r — (1 k) o (1 k — l)o • • • o(l 2), i.e., 7r 
is a product of k — 1 transposition. If 7r is an even permutation then A: — 1 is 
even and so k is odd. On the other hand, if k is odd, then k — 1 is even and so 
7 r is an even permutation. This completes the proof. ■ 

Let A n denote the subset of S n consisting of all even permutations, n > 2. 

Theorem 3.1.22 For n > 2, the pair (A n ,o) is a group, called the alternat¬ 
ing group on I n . 

Proof. Since e = (1 2 ) o (1 2 ), e E A n . Thus, A n 7 ^ 0. A product 7 Ti 0772 
is even if and only if tt\ and 7T2 are both even or both odd by Theorem 3.1.19. 
Therefore, A n is closed under o. If 7r 6 A n , then 7r o 7 r -1 = e is even and hence 
7r _1 € A n . Hence, (A n , o) is a group. ■ 

Cauchy recognized many important properties of A n . Among others, he 
proved the following theorem. 

Theorem 3.1.23 Every element in A n is a product of 3-cycles, n > 3. 

Proof. Let 7r e A n . Then 7r = o\ o 02 0 - • • ° & r , where <7* is a transposition, 
1 < z < r, and r is even. Now for any transposition (a 6), 

(a b ) = (1 a) o (1 b) o (1 a). 


Thus, 


7T = (1 ii) o (1 i 2 ) o ■ ■ • o (1 i m ) 


where m is even. Since (1 ii) o (1 22 ) = (1 12 zi), it follows that 7r is a product 
of 3-cycles. ■ 


3.1.1 Worked-Out Exercises 

0 Exercise 1 Prove that two cycles in S n are conjugate if and only if they 
have the same length. 

Solution: Let a = {i\i 2 • • -z r ) and (5 = {jij 2 • • -j s ) t> e two cycles in S n . 
First suppose that a and /3 are conjugate. Then j3 = cr~ 1 o a o o for some 
a 6 S n . Since a is onto and i; 6 I n , there exists ki such that a{k{) = ii for all 
l = 1,2, ... ,r . Now 

(jij 2 ■■ -js) = (o-'^zi)^* 1 ^) • • -cr' 1 ^)) ( by Theorem 3.1.13) 

= (kik2 ■ ■ ■ k r ). 

Hence, s — r and so a and (3 are of the same length. 

~PuAuc- TfeoiAewtaftca/ 
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Conversely, let a — {i\i 2 • • -i r ) and j3 = {j\j 2 ■ ■ • j r ) be two cycles in S n 

( 'l'\ XO * * ' % \ 

T , i.e., cr(ii) — ji for all l = 

3\ 32 “■ Jr ) 

1 ,2,..., r, and a (a) = a for all a £ I n \{i 1 , 12 , ■ ■ ■ , v}- Then cr € S n . Now 
a~ 1 o(3oa = (cr~ l {ji)cr~\j 2 )--a~ 1 (jr)) = (w-'-ir) = <*• 

<0 Exercise 2 Express the permutation 


^1 2345678^ 
^2385647lJ 


on Is as a product of disjoint cycles and then as a product of transposition. 
Is a an even permutation? 


Solution: We have cr(l) = 2, cr 2 (l) — cr(2) = 3, cr 3 (l) = cr(3) = 8 , and 
<7 4 (1) = cr ( 8 ) = 1. Thus, (1 2 3 8 ) is a cycle. Now 4 is the first element of 
Is not appearing in (1 2 3 8 ). We have a( 4) = 5, cr 2 (4) = a( 5) = 6 , and 
ct 3 (4) = <j( 6 ) = 4. Hence, (4 5 6 ) is also a cycle in cr. Next, 7 is the first element 
of Is not appearing in (1 2 3 8 ) and (4 5 6 ). Now cr( 7) = 7. Since all the 
elements of Is appear in one of the cycles (1 2 3 8 ), (4 5 6 ), and (7), we have 
cr = (1 2 3 8 ) o (4 5 6 ). Now (1 2 3 8 ) = (1 8 ) o (1 3) o (1 2) and (4 5 6 ) = (4 
6 ) o (4 5). Thus, cr — (1 8 ) o (1 3 ) o (1 2) o (4 6 ) o (4 5). Since a is a product of 
five transpositions, cr is not an even permutation. 


0 Exercise 3 Write all elements of S 4 . Show that £4 has no elements of order 
> 5. 


Solution: Let cr £ 54 and cr = aj ocr 2 ° • • • ocr^, a product of disjoint cycles. 
Since S 4 is a permutation group on I 4 , k < 2 . If k = 1 , then cr is a 2 -cycle, 
3-cycle, or 4-cycle. If k — 2, then cr is a product of two disjoint transpositions. 
The number of distinct cycles of length 2 is 6 , the number of distinct cycles of 
length 3 is 8 , and the number of distinct cycles of length 4 is 6 . Hence, S 4 = {e, 
(1 2), (1 3), (1 4), (2 3), (2 4), (3 4), (1 2 3), (1 3 2), (2 3 4), (2 4 3), (1 3 4), 
(1 4 3), (1 2 4), (1 4 2), (1 2 3 4), (1 3 2 4), (1 4 2 3), (1 2 4 3), (1 3 4 2), (1 4 
3 2), (1 2) o (3 4), (1 4) o (3 2), (1 3) o (2 4)}. 

Since each 2 -cycle is of order 2 , each 3-cycle is of order 3, each 4-cycle is of 
order 4, and the order of the product of two disjoint 2-cycles is 2, S 4 has no 
element of order > 5. 


<0 Exercise 4 Find the order of (1 2 3 4) o (5 6 7) in S 7 . 


Solution: o(l 2 3 4) = 4, o(5 6 7) - 3. Now (1 2 3 4) and (5 6 7) are 
disjoint. Hence, (1 2 3 4) 0 (5 6 7) = (5 6 7) o (1 2 3 4). If a and 6 are two 
elements of a group G such that 0(0) = m, 0(6) = n, and gcd(m,n) = 1 , then 
o(ab) = m.n. Using this result, we find that the order of (1 2 3 4) o (5 6 7) is 12 . 
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0 Exercise 5 Find the order of (1 2 3 4) o (5 6 ) in Sq. 

Solution: o(l 2 3 4) — 4, o(5 6 ) — 2. Now (1 2 3 4) and (5 6 ) are disjoint 
and so they commute. Thus, ((1 2 3 4) o (5 6)) 4 — e. Now ((1 2 3 4) o (5 
6)) 1 + e, ((1 2 3 4) o (5 6)) 2 = (1 2 3 4 ) 2 o (5 6) 2 = (1 2 3 4 ) 2 ^ e. If ((1 2 3 
4) o (5 6)) 3 — e, then the order of (1 2 3 4) o (5 6 ) will be 3 and 3 divides 4, a 
contradiction. Hence, the order of (1 2 3 4) o (5 6 ) is 4. 

3.1.2 Exercises 

1 . Express the following permutations as (i) a product of disjoint cycles and 
(ii) a product of transpositions: 

( 1 2 3 4 5 6 Wl 2 3 4 5 e\ 

1 3 5 4 1 6 2 y’\ 3 2 1 5 4 6 j' 

2 . Let a — (1 2 5 7) and (3 = (2 4 6 ) € S 7 . Find a o (3 o a -1 . 

3. Let a = (1 3 5 7) and (3 = (2 4 8 ) o (1 3 6 ) G S 8 . Find ao(3oa~ l . 

4. Let a — (1 3) o (5 8 ) and /3 = (2 3 6 7) € Ss • Find a o (3 o a~ 1 . 

5. Let a = (2 5 9) o (1 3 6 ) and (3 = (1 5 7) o (2 4 6 9) G Sg. Find qo/ 3 oq -1 . 

6 . Let (1 3 5 7) and (2 3 6 8 ) G. Ss. Find a € Sg such that a o (1 3 5 
7) o a -1 = (2 3 6 8 ). 

7. If a — (1 2 3 4 5 6 ), show that a = (1 6 ) o (1 5) o (1 4) o (1 3) o (1 2). 

8 . Find the order of (1 2 3) o (4 5) in Ss. 

9. Prove that (1 2 • ■ • n — 1 n ) -1 = (n n — 1 • • • 2 1). 

10. Prove that every transposition is its own inverse. 

11. Prove that the symmetric group on two symbols ( 52 ,°) is commutative. 

12. Let a = (ai <22 * • • a*,) € S n be a fc-cycle. Show that 

a 2 _ I { a i °3 • • • 02 m-i) 0 (&2 0,4 ag • • • a 2 m ) if A; = 2m, i.e., is even 
| (01 03 • ■ • G 2 m+i &2 • • • fl 2 m) if k — 2 m + 1 , i.e., /c is odd. 

13. Determine A 4 . 

14. Let a,/3 £ S n . Show that a -1 o / 5 _1 o a o (3 e A n . 

15. Prove that |j4 n | = y. 

Ptt^LC “PlLySAjcS. 



3.1. PERMUTATION GROUPS 


97 


16. Show that the number of distinct cycles of length r in S n is 

17. Let n > 2 and a G S n be a cycle. Show that a is a k -cycle if and only if 
o(ct) — k. 

18. Let a e S n and cr = a\oa 2 o- • -ocr fc be a product of disjoint cycles. Suppose 
o(a l ) = rii, i = 1,2,..., n. Show that o(cr) = lcm(n!, n. 2 ,..., n *.). 

19. Let a G S n and p be a prime. 

(i) Show that o(a) — p if and only if either a is a p-cycle or a is a product 
of disjoint cycles, where each cycle is either of length 1 or length p and 
at least one cycle is of length p. 

(ii) If a is a p-cycle, prove that either aT 1 = e or a m is a p-cycle for all 
m G N. 

20. Let a and (3 G S n . Let a = a i o c *2 ° • o and (3 = (3\ o (3 2 o ■■■ o (3 S 

be a product of disjoint cycles. Let length(aj) = di and length(/lj) = m,j 
for all i = 1,2,..., k and j = 1,2,..., s and d\ < d 2 < • • • < and 
mi < m 2 < • •• < m s . We say that a and (3 . have the same cyclic 
structure if k = s and d z ~ rrii for all i = 1, 2,..., k. Prove that a and 
(3 have the same cyclic structure if and only if a and [3 are conjugate. 

21. Prove that for tt G S ni tt is an even permutation if and only if = X. 

22. (i) Let a — (k l), (3 G S n be two distinct transpositions, n > 3. Show that 
there exist transpositions p, v G S n such that f3 o cx — v o p, p(k) = k and 
v moves k. 

(ii) Prove that if the identity permutation e G S n can be written as a 
product of r (> 3) transpositions, then e can be written as a product of 
r — 2 transpositions. 

(iii) Prove that if e = ay o<j 2 o • • • ooy G S n as a product of transpositions, 
then r is even. 

(iv) Use (i), (ii), and (iii) to prove that if it G S n , then 7r can be written 
as a product of either an even or an odd number of transpositions, but 
not both. 
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August in-Louis Cauchy (1789- 
1857) was born on August 21, 1789, in 
Paris, France. He received his first educa¬ 
tion from his father. He was a neighbor of 
Laplace and Berthollet. Cauchy became ac¬ 
quainted with famous scientists at a young 
age. Lagrange is said to have warned his fa¬ 
ther not to show Cauchy any mathematics 
book before the age of seventeen. 

At the age of fifteen, he completed his 
classic studies with distinction. He became 
an engineer in 1810, in the Napoleon army. 
In 1813, he returned to Paris. 

In 1811, Cauchy started his mathemati¬ 
cal career by solving a problem sent to him 
by Lagrange on convex polygons. In 1812, he solved Fermat’s famous classical problem 
on polygon numbers. His treatise on the definite integral, which he submitted in 1814 
to the French Academy, later became a basis of the theory of complex functions. 

In 1816, he was appointed full professor at the Ecole Polytechnique. More theorems 
and concepts have been named for Cauchy than for any other mathematician. There 
are sixteen concepts and theorems named for Cauchy in elasticity alone. 

He worked on mathematics, mathematical physics, and celestial mechanics. In 
mathematics, he worked on several areas, such as calculus, complex functions, algebra, 
differential equations, geometry, and analysis. The notion of continuity used today was 
invented by Cauchy. He also proved that a continuous function has a zero between 
two points where the function changes its signs, a result also proved by Bolzano. The 
first adequate definitions of indefinite integral and definite improper integral are due 
to Cauchy 

In algebra, the notion of the order of an element, a subgroup, and conjugates are 
found in his papers. He proved the famous Cauchy’s theorem for finite groups, that is, 
if the order of a finite group is divisible by a prime p, then the group has a subgroup 
of order p. Cauchy’s role in shaping the theory of permutation groups is central. He is 
regarded by some to be the founder of finite group theory. The two-row notation for 
permutations was introduced by Cauchy. He also defined the product of permutations, 
inverse permutations, transpositions, and the cyclic notation. He wrote his first paper 
on this subject in 1815, but did not return to it for nearly thirty years. In 1844, he 
proved that every permutation is a product of disjoint cycles. 

He also did work of fundamental importance in the theory of determinants. His 
treatise on determinants, published in 1812, contains important results concerning 
product theorems and the inverse of a matrix. 

Cauchy enjoyed teaching. He published more than 800 papers and eight books. 
He died on May 22, 1857. 
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Chapter 4 

Subgroups and Normal 
Subgroups 


In Chapter 2, we began a discussion of the evolution of group theory. This 
chapter seems a good place to renew the discussion. It took more than 100 
years for the abstract concept of a group to evolve. The evolution followed 
lines similar to the evolution of other theories. First came the discovery of 
isolated phenomena, followed by the recognition of features common to all. 
Then came the search and classification of other instances. Next, general prin¬ 
ciples emerged. Last, the abstract postulates which define the system were 
uncovered. A deeper account can be found in Bell. 

4.1 Subgroups 

In the previous chapter, we saw that for the groups ( A n , o) and (S n , o), A n is 
a subset of S n . One can think of many examples, where the underlying set of 
one group is a subset of the underlying set of another group. This leads us to 
the concept of a subgroup. 

Let ( G , *) be a group and if be a nonempty subset of G. Then if is said 
to be closed under the binary operation * if a * b £ if for all a, b E if. 

Suppose if is closed under the binary operation *. Then the restriction of 
* to if x if is a mapping from if x if into if. Thus, the binary operation * 
defined on G induces a binary operation on if. We denote this induced binary 
operation on if by * also. Thus, (if, *) is a mathematical system. It also follows 
that * is associative as a binary operation on if, i.e., a * (6 * c) = (a * b) * c 
for all a, 6, c 6 if. If (if, *) is a group, then we call H a subgroup of G. More 
formally, we have the following definition. 

Definition 4.1.1 Let (G,*) be a group and if be a nonempty subset of G. 
Then (if, *) is called a subgroup of (G, *) if (if, *) is a group. 
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Let (H,*) be a subgroup of a group (G, *). Let eu denote the identity of 
H and e denote the identity of G. Now e# * e# = e# = ejj * e. Hence, by the 
cancellation property, e// = e. Thus, the identity elements of G and H are the 
same. Now let h G H. Let h' denote the inverse of h in H and hr 1 denote the 
inverse of h in G. Then h! = h' * e — h' * {h* h ~ l ) = ( h! * h) * h~ l = e * h~ l — 
Thus, the inverse of h in H and the inverse of h in G are the same. 

Of course, if (G, *) is a group, then ({e},*) and (G, *) are subgroups of 
(G, *). These subgroups are called trivial. 

Example 4.1.2 Consider the following list of groups. 

(z) ({0},+), (Z, +), (Q,+), (R,+), (C,+), 
(zz)({l},.),(Q\{0},-),(R\{0},-),(C\{0},-), 

where + is the usual addition operation and ■ is the usual multiplication oper¬ 
ation. Each group is a subgroup of the group listed to its right. For example, 
(Z, +) is a subgroup of ( Q, +), (R, +), and (C, +), and (R\{0}, •) is a subgroup 

°f (c\{o}, ■). 

In the remainder of the text, we shall generally use the notation G instead 
of (G, *) for a group and we write ab for a*b. We shall refer to ab as the product 
of a and b. This notation is usually called multiplicative notation. 

Readers with some knowledge of linear algebra should notice the similarity 
with respect to the type of results and order of presentation of those which 
immediately follow. First comes a result which gives an easy method of deter¬ 
mining if a nonempty subset is a substructure. This is followed by the result 
that the intersection of any collection of substructures is a substructure. Next, 
comes the definition of a substructure “generated” by a subset. Finally, a the¬ 
orem describing the substructure generated by a given subset. These ideas 
appear throughout algebra. We will encounter them again, for example, when 
we examine ideals of a ring. 

Theorem 4.1.3 Let G be a group and H be a nonempty subset of G. Then H 
is a subgroup of G if and only if for all a,b € H, ab ^ 1 G H. 

Proof. Suppose H is a subgroup of G. Let a, b G H. Since H is a sub¬ 
group, it is a group and so b ~ 1 € H. Thus, ab~ l € H since H is closed under 
the binary operation. Conversely, suppose H is a nonempty subset of G such 
that a, b € H implies ab~ l G H. Since H ^ <fi, there exists a G H. Therefore, 
e = aa~ l G H, i.e., H contains the identity. Now for all b G H, 6 -1 = eb G H , 
i.e., every element of H has an inverse in H. Thus, for all a, b G H, a, 6 -1 G H 
and so ab = a(b~ l )~ l G H , i.e., H is closed under the binary operation. From 
the statements preceding Definition 4.1.1, associativity holds for H. Hence, H 
is a group and so H is subgroup of G. ■ 
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In order to see whether a certain nonempty subset of a given group is a 
subgroup or not, we can use Theorem 4.1.3. 

Corollary 4.1.4 Let G be a group and H be a finite nonempty subset of G. 
Then H is a subgroup of G if and only if for all a,b E H, ab E H. 


Proof. If H is a subgroup, then for all a,b E H, ab E H. Conversely, suppose 
that for all a,b E H, ab E H. Let h E H. Then h, h 2 ,..., h n ,... £ H and so 
{h, h 2 ,..., h n , .. .} C H. Since H is finite, all elements of {h, h 2 , ..., h n ,...} 
cannot be distinct. Thus, there exist integers r and s such that 0 < r < s and 
h r = h s . Hence, e — h s ~ T E H. Now s — r > 1. Thus, e — hh s ~ r ~ l implies that 
h~ l — /i s ~ r_1 € H. Let a, 6 € H. Then a, b~ l E H and so ab~ l E H by the 
hypothesis. Thus, by Theorem 4.1.3, H is a subgroup. ■ 

Theorem 4.1.5 Let G be a group and Z(G ) = {b E G \ ab = ba for all a E G}. 
Then Z(G) is a commutative subgroup of G. Z{G ) is called the center of G. 


Proof. Since ae — a — ea for all a E G, e E Z{G ) and so Z(G) ^ f. Let 
a,b E Z(G). Then be = cb for all c E G. From this, it follows that cb~ l — b~ l c 
for all c E G and so 6 _1 E Z{G). Now ( ab~ 1 )c = a(b~ 1 c ) = a(c6 -1 ) = (ac)6 _1 = 
(■ ca)b ~ 1 = c(ab~ l ) for all c E G and so a& -1 E Z(G). Hence by Theorem 4.1.3, 
Z{G ) is a subgroup of G. That Z(G) is commutative follows by the definition 
of Z(G). ■ 

In the remainder of this section, we will see how new subgroups arise from 
existing subgroups of a group. 

Theorem 4.1.6 Let G be a group and {H a \ a E 1} be any nonempty collection 
of subgroups ofG. Then n a ^iH a is a subgroup ofG. 


Proof. Since each H a is a subgroup, e € H a for all a E I. Hence, e E 
n aeI H a and so D a ^jH a ^ 0. Let a, b E D a ^jH a . Then a,b E H a for all 
a E I. Thus, ab -1 E H a for all a E I since each H a is a subgroup and so 
a6 -1 € n a€ /iZa. Consequently, n Q( =/ff Q is a subgroup of G by Theorem 4.1.3. 


Definition 4.1.7 Let G be a group and'S be a, subset ofG. Let 
S — {H | H is a subgroup of G and. S C H}. 


Define 


= Oh&sH, 


i.e., { S) is the intersection of all subgroups H of G such that S C H. Then the 
subgroup {S) of G is called the subgroup generated by S. If G — (S) , then S 
is called a set of generators for G. 
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If either S — (p or S = {e}, then { S) — {e}. Also, { G) — G. 

We now proceed to obtain a characterization of a subgroup generated by a 
nonempty subset in terms of the elements of the group. 

Let S = {H | H is a subgroup of G and S C if}, where 5 / Then 
(S, <) is a partially ordered set, where < denotes the set inclusion relation. 
In this poset, { S ) is the least element. Hence, (S) is the smallest subgroup 
of G which contains S. Since (S) is a subgroup of G, we must have for any 
si,..., s n G 5, the product s^ 1 ■ • • s^ n € { S ) , where e* = ±1 for i = 1,2, ..., n. 
Thus, if A denotes the set (s^ 1 ■ ■ ■ s^ 71 | 5, £ 5, e, = ±1, i = 1,2,... , n; n = 1,2, 
...}, then A C { S). Note that if s G S, then e = ss -1 G A. In the following 
theorem, we show that A — ( S ). Therefore, S does “generate” (S) in the sense 
of multiplying elements of S or their inverses together to build up the smallest 
subgroup containing S. 

Theorem 4.1.8 Let S be a nonempty subset of a group G. Then 

(S) = {s^ 1 • • • s® 71 | Si G 5, e z = ±1, i = 1,2,... ,n; n = 1,2,...}. 


Proof. Let 


4 = {*?••■« 


Cn 


G S', Ali ^ 1,2,..., n, 


n 



We have already noted that A C (S) . We show that ( S) C A by showing that 
A is a subgroup of G containing S. (Recall that (S) is the smallest subgroup 
of G containing S .) Let s € S. Then s = s l G A and so S C A. Let s{ x • ■ ■ s(^, 
t^ 1 ■ ■ ■ tq Q G A. Then 


Oi • ■ • X *? 1 •■•*?*) 1 = »i • • • sf?t q 9q • • -t x 91 eA. 


Thus, A is a subgroup of G by Theorem 4.1.3. Hence, (S) C A. ■ 


For a G G, we use the notation (a) rather than ({a}) to denote the subgroup 
of G generated by {a}. 

Corollary 4.1.9 Let G be a group and a G G. Then (a) = { a n \ n G Z}. 


Proof. By Theorem 4.1.8, we have (a) = {a ei ■ - • a em \ = ±1, i = 1,2, 
..., m\ m — 1,2, ...} = { a ei+---+e m | e . = ±1, i = 1,2, ..., m; m = 1,2,...} = 
{o n | n G Z}. I 

In additive notation, we would have (a) = { na \ n G Z}- 
Let n > 3. In Chapter 3, we proved that every element of A n is a product 
of 3-cycles (Theorem 3.1.23). In the following theorem, we conclude that A n 
is generated by the set of all 3-cycles. 

Theorem 4.1.10 Let n > 3. Then A n is generated by the set of all 3-cycles. 
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Proof. Since a 3-cycle is an even permutation, every 3-cycle is in A n . By 
Theorem 3.1.23, every element of A n is a product of 3-cycles. Hence, A n is 
generated by the set of all 3-cycles. ■ 

We now turn our attention to the product of subgroups. 

Definition 4.1.11 Let H and K be nonempty subsets of a group G. The prod¬ 
uct of H and K is defined to be the set 

HK = {hk \ he Hike K}. 

Let H 2 , H n be nonempty subsets of a group G. We define the 
product, H 1 H 2 ■ ■ • H n , of Hi, H 2 , ..., H n to be the set 

H 1 H 2 • • ■ H n = {hih 2 • • • hn I hi e Hi, i = 1,2,.. ., n}. 

Example 4.1.12 Consider the group of symmetries of the square. Let H = 
{ r 360i^i} an d K = {7~360j^}- Then H and K are subgroups of G. Now 

HK — {rseors60, r 36oh,dir^Qo, d\h} = {r^eo,h,di,r 9 o}- 

Since hd\ = r 270 ^ HK, HK is not closed under the binary operation. 
Hence, HK is not a subgroup of the symmetries of the square. Also, note that 


KH — {r 36 07'360T360^i, hr 36 o, hdi} = {r 36 o, d\, h, r 2 7o}, 


and, 

(HUK) - {r36o,r 90 ,ri8o,r270,h,v,di,d 2 }. 

Example 4.1.12 shows that in general the product of subgroups need not 
be a subgroup. In the following theorem, we give a necessary and sufficient 
condition for the product of subgroups to be a subgroup. 

Theorem 4.1.13 Let H and K be subgroups of a group G. Then HK is a 
subgroup of G if and only if HK = KH. 

Proof. Suppose HK is a subgroup of G. Let kh e KH, where h e H and 
k e K. Now h = he e HK and k = ek € HK. Since HK is a subgroup, it 
follows that kh € HK. Hence, KH C HK. On the other hand, let hk £ HK. 
Then ( hk )“ 1 6 HK and so ( hk)~ l = h\k\ for some hi E H and ki £ K. 
Thus, hk = ( hiki)~ l = kf l hf l £ KH. This implies that HK C KH. Hence, 
HK = KH. 
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Conversely, suppose IIK — KH. Let h\k\, h 2 k 2 6 HK. Now k^hf 1 £ 
KH — HK. This implies that k^Kf 1 = h^k^ for some /13 £ H and k$ £ K. 
Similarly, kih% = h^k^ for some £ H and k 4 £ K. Thus, 

(h\ki)(h 2 k 2 )~ l = hikik^ 1 ^ 1 

= hikih 3 ks 
= hih^k^k^ £ HK. 

Hence, HK is a subgroup of G by Theorem 4.1.3. ■ 

Corollary 4.1.14 If H and K are subgroups of a commutative group G , then 
HK is a, subgroup of G. 

Proof. Since G is commutative, HK — KH. The result now follows by 
Theorem 4.1.13 ■ 

The following theorem gives another necessary and sufficient condition for 
a product of subgroups to be a subgroup. 

Theorem 4.1.15 Let H and K be subgroups of a group G. Then HK is a 
subgroup of G if and only if HK — (H U K) . 

Proof. First suppose that HK is a subgroup of G. Let h £ H. Then 
h = he £ HK. Thus, H C HK. Similarly, K C HK. Hence, H U K C HK. 
Since (H U K) is the smallest subgroup of G containing H U K, it follows that 
(H U K) C HK. Let hk £ HK, where h £ H and k £ K. Since H C (H U K) 
and K C (# U Jf) , we have h,k £ (H U K). Thus, hk £ ( H U K) . This im¬ 
plies that HK C (H U K) . Hence, HK — (H U K) . The converse is immediate 
since (H U K) is a subgroup and HK = (H U K) . ■ 

Let G be a group. We denote by S(G) the set of all subgroups of G. 

Theorem 4.1.16 Let G be a group. Then (5(G), <) is a lattice, where < is 
set inclusion relation. 


Proof. Proceeding as in Example 1.4.5, we can show that the set inclusion 
relation is a partial order on 5(G). We now show that for all A, B £ 5(G), 
Ay B, A A B £ 5(G). Let A, B £ 5(G). By Theorem 4.1.6, A n B £ 5(G) 
and by the definition of 5(G), {A U B) £ 5(G). Now A, B C (All B) and so 
(All B) is an upper bound of A and B. Let G £ 5(G) be such that ACC 
and B C G. Then All B C C and so (All B) C G. Thus, (A U B) is the least 
upper bound of A and B, i.e., A V B = (A U B) . Hence, A V B £ 5(G). Next, 
we show that A A B = H. fl B, i.e., A(1 B is the greatest lower of A and B. 
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Now A fl B C A and A D B C B and so A fl B is a lower bound of A and B. 
Let D G S{G ), D C A, and D C B. Then D C A fl B and so A D B is the 
greatest lower bound of A and B , i.e., AAB = AC\B. Therefore, AAB G S(G). 
Consequently, (S(G), <) is a lattice. ■ 

The lattice ( S(G ), <) in Theorem 4.1.16 is called the subgroup lattice of 
the group G. Let (T, <) be a sublattice of ( S(G ), <), i.e., T C S(G) and (T, 
<) is a lattice. The poset diagram of (T, <) is called the lattice diagram. 
This lattice diagram will be useful in studying the interrelations among the 
subgroups of a group. Consider the following example. 

Example 4.1.17 (i) Let G = {1,-1, i, — i}. Then (G, *) is a group, where * 
is the usual multiplication of com.pl.ex numbers. Let 


$ - {{!}> {*> • 


The lattice diagram, of S is: 


G 

{i.-i} 

{i} 

(ii) Let G — {(1,1), ( 1 ,- 1 ), (—1,1), ( — 1 ,- 1 )}. Then (G, *) is a group, 
where * is defined by (a, 6 ) * (c, d) = ( ac,bd) for all (a, 6 ), (c, d) G G , where 
the multiplication ac and bd take place in the integers. Let E = {(1,1)}, 
Hi = {(1,1), (1,-1)}, H 2 = {(1,1), (-1,1)}, andHs = {(1,1), (-1,-1)}- 
Let S — { E , Hi, H 2 , H$, G}. The lattice diagram of S is: 


G 



We see from these examples that a lattice diagram gives a visual picture of 
how subgroups of a given group are related. 

Next, we consider an example of a group generated by two elements. We list 
several properties of the group. We ask the reader to verify these properties. 
We will study these types of groups in more detail in later chapters. 
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Example 4.1.18 Let G = (a,b ), where a 3 = e, b 2 = e, and ( ab ) 2 = e. Then 

(i) ab — ba~ l , ba = a~ l b, and a 2 b = ba. 

(ii) G is not commutative since ab ^ ba. 

(in) ba s = a~ s b for all positive integers s. 

(iv) By (i) and (in) 


a r b i a s U 


a r+s V ifi = 0 

a r- s b i+j = 


(v) Since a 3 = e — b 2 , every element of G is of the form. a T b l , 0 < r < 3, 
i = 0,1 by (iv). 

(vi) G = {e,a,b,ab,a 2 ,a 2 b}. Thus, |G| = 6. 

(vii) o(a) = 3 = o(a 2 ), o(b) = o(ab) = o(a 2 b) = 2. 

(viii) The only subgroups of G are {e}, (a) = (a 2 ), (b) , (ab) , ( a 2 b ), and 
G. 

G is called a dihedral group of degree 3 and is denoted by D 3 . In general, 
a. dihedral group 1 of degree n is D n — (a, b) , where (ab ) 2 = e, o(a) = n, and 
o (b) = 2. In Chapter 5, we consider a dihedral group of degree 4, D 4 , and study 
this group in detail. 


4.1.1 Worked-Out Exercises 

0 Exercise 1 Let if be a subgroup of a group G. Let g € G. Prove that 

(i) gHg~ l — {ghg~ l \ h 6 H] is a subgroup of G , 

(ii) \gHg _1 | = \H\. 

Solution: (i) We first show that gHg~ l ^ 0 and then use Theorem 4.1.3. 
Since e = geg~ l <E gHg ~ l , gHg~ l ^ 0. Let ghig~ l , gh 2 g~ 1 G gHg~ l . Then 

(gh\g~ l )(gh 2 g~ l )~ l = ghig^gh^ l g~ l = gh\}q' l g~ l G gHg~ l . 

Hence, gHg~ l is a subgroup of G. 

(ii) Let g G G. To prove that \gHg~ l \ = \H\, we show that there exists a 
one-one onto function of H onto gHg~ 1 . Define / : H —> gHg~ l by f(h ) = 
p/ig -1 for all h G H. Let h,h' G if. If h = h', then ghg~ l = gh'g ~ 1 , i.e., / is 
well defined. Also, ghg~ l G gHg~ l . Thus, / is a function of H into gHg~ 1 . 
Suppose /(/?,) = f(h'). Then ghg~ l = gh 1 g~ l . From this it follows that h = h!. 
This shows that / is one-one. To show / is onto gHg _1 , let a G gHg~ l . Then 
a — gbg _1 = f(b) for some b G H, namely, b = g~ l ag. Thus, / is onto gHg~ l . 

0 Exercise 2 Prove that S n is generated by {(1 2), (1 3), (14),..., (1 n)}. 
1 We show the existence of such groups in Chapter 7. 
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Solution: Let n be any permutation in S n . Then 7 r is a product of trans¬ 
positions. Thus, it is sufficient to show that if (? j) is any transposition in 
i < j, then 

(ij)e((l 2 ), (1 3), (1 4).(1 n)). 

This follows from the fact that (i j ) = (If ) 0 (1 j) ° (1 f). Hence, S n is generated 

by {(1 2), (1 3), (1 4),..., (1 n)}. 

{> Exercise 3 Find all subgroups of (Z, +). 

Solution: Let H be a subgroup of Z. Suppose H ^ {0}. Let a be a 
nonzero element of H. Then —a E H. Since either a or —a is a positive integer, 
H contains a positive integer. With the help of the principle of well-ordering, 
we can show that H contains a smallest positive integer. Let a be the smallest 
positive integer in H. We claim that H = {na \ n E Z}. 

Now na e H for all n E Z and so {na \ n E Z} C H. On the other hand, let 
b E H. By the division algorithm, there exist c and r in Z such that b = ca + r, 
where 0 < r < a. Suppose r ^ 0. Then r — b — ca E H. Thus, H contains a 
positive integer smaller than a, a contradiction. Hence, r — 0 and so b — ca E 
{na | n E Z}. This implies that H C {na \ n E z}. Thus, H — {na \ n E Z} for 
some a E Z. Also, for all n E Z, the set T = {nm j m E Z} = nZ is a subgroup 
of Z. Hence, nZ, n = 0, 1,2 ,... are the subgroups of Z. 


4.1.2 Exercises 

1. Prove that H is a subgroup of the group G, where 

(i) H = {[ 0 ], [ 2 ], [4], [ 6 ], [ 8 ], [10]}, G = Z 12 , 

(ii) H = {[0], [3], [ 6 ], [9]}, G = Z 12 

and where the group operation under consideration is + 12 . 


2. Let GL{ 2, R) denote the group of all nonsingular 2x2 matrices over R. 
Show that each of the following sets is a subgroup of GL{ 2,R). 


(i)S = 


a b 
c d 


ad — be — 1 


(ii) S = 

(iii) S = 

(iv) S = 


a 0 
0 a 


a ^ 0 


a b 
—b a 


either a or b is nonzero 


a b 
0 d 


ad 7^ 0 > . 
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(v) S = 


a —b 
b a 


a,b € R and a 2 + 6 2 7 ^ 0 


3. Show that the set H = {a + bi G C* | a 2 + b 2 = 1} is a subgroup of 
(C*, •), where • is the multiplication operation of complex numbers. 


4. Let G = {(a, 6 ) | a, b G R, b 7 ^ 0}. Prove that (G, *) is a noncommuta- 
tive group under the binary operation (a, b) * (c, d) = (a 4- be , bd) for all 
(a, b), (c, d) G G. 

(i) Let H = {(a, b) G G | a = 0}. Show that H is a subgroup of G. 

(ii) Let K = {( 0 , 6 ) G G \ b > 0}. Show that K is a subgroup of G. 

(iii) Let T = {(a, b) G G | b — 1 }. Show that T is a subgroup of G. 

(iv) Find all elements of order 2 in G. 


5. In S 3 , determine the set T — {x G S 3 | x 2 — e}. Is T a subgroup of S 3 ? 

6 . Determine the subgroup (4,6) in (Z,+). 

7. In (Z,+), determine the subgroup generated by {4,5}. 

8 . List the elements of the following subgroups. 


(i) 


12 3 4 
4 3 2 1 


12 3 4 
2 14 3 


in S 4 . 


(ii) (/i, v) in the symmetries of the square. 


9. Let a = (1 2 3 4) and b — (2 4) G 64 . 

(i) Find o(a) and °(b). 

(ii) Show that ba — a 3 6 = a~ 1 b. 

(iii) Find H = (a,b) in S 4 . 

(iv) Find \H\. 

10. Let G be a group generated by a, 6 such that 0 ( 6 ) = 2 , o(a) = 6 , and 
(a &) 2 = e. Show that 

(i) aba = b, 

(ii) (a 2 6) 2 = e, 

(iii) ba 2 b = a 4 , 

(iv) ba 3 b = a 3 . 


11 . Let G be a group. Prove that a nonempty subset H of G is a subgroup 
if and only if for all a, 6 G if, ab G H and a ' 1 G H. 

~Pu*uc- 
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12. Let G be a commutative group. Show that the set H of all elements of 
finite order is a subgroup of G. 

13. Let G be a group and a E G. Show that if a is the only element of order 
n in G, then a E Z(G). 

14. Show that Z(S n ) = {e} for all n > 3. 

15. Let G be a group and a E G. Let C(a) = {b E G \ ba = ab}. Prove that 
C(a) is a subgroup of G and that Z(G) = n a ^oO(a). C(a ) is called the 

centralizer of a in G. 

16. Prove that a group G cannot be written as the union of two proper 
subgroups. 

17. Let G be a group and H be a nonempty subset of G. 

(i) Show that if H is a subgroup of G, then HH = H. 

(ii) If H is finite and HH C H , prove that H is a subgroup of G. 

(iii) Give an example of a group G and a nonempty subset H of G such 
that HH C H, but H is not a subgroup of G. 

18. Let If be a subgroup of a group G. Prove that (H) — H. 

19. If A and B are subgroups of a group G , prove that A U B is a subgroup 
of G if and only if A C B or B C A. If C is also a subgroup of (7, does 
a similar necessary and sufficient condition hold for A U B U C to be a 
subgroup of G1 

20. Let G be a commutative group. If a and b are two distinct elements of 
G such that o(a) = 2 = o(5), show that |(a, 6)| =4. 

21. (i) Prove that S n is generated by {(1 2), (1 2 3 ■ • ■ n)}. 

(ii) Prove that S n is generated by {(1 2), (2 3), (3 4),..., (n — 1 n)}. 

22. Show that (Q, +) is not finitely generated. 

23. Let G be a group. Prove that if G is finite, then G has finitely many 
subgroups. 

24. Does there exist an infinite group with only a finite number of subgroups? 

25. For the following statements, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) All nontrivial subgroups of (Z,+) are infinite groups. 
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(ii) If A , B, and C are subgroups of a group G such that A\JB C C, then 
ABC C C. 

(iii) If G is a noncommutative group, then Z(G) = {e}. 

(iv) Let G be a group. If H is a nonempty subset of G such that a -1 € if 
for all a G if, then H is a subgroup of G. 

(v) There exists a proper subgroup A of (Z, +) such that A contains both 
2Z and 3Z. 

(vi) If if is a subgroup of (Q, +) such that Z C if, then H = Q. 

(vii) If if is a subgroup of (Q*, •) such that Z\{0} C if, then if = Q*. 


4.2 Cyclic Groups 

In the previous section, we introduced the notion of a subgroup generated by 
a set. Groups that are generated by a single element, called cyclic groups, are 
of special importance. As we shall see throughout the text, these groups play 
an important role in studying the structure of a group. In fact, all of Chapter 
9 revolves around these groups. Cyclic groups are easier to study than any 
other group. They have special properties, some of which we will discover in 
this section. 


Definition 4.2.1 A group G is called, a cyclic group if there exists a G G 
such that 


G = (a). 


We recall that (a) in Definition 4.2.1 is the set { a n \ n G Z} (Corollary 
4.1.9), 

Let G — (a) be a cyclic group and b,c G G. Then b — a n and c = a m for 
some n,m G Z. Now be = a n a m = a n+m = a m+n = a m a n = cb. This shows 
that G is commutative. Hence, every cyclic group is commutative. 


Example 4.2.2 (i) (Z, +) is a cyclic group since Z = (1) . 

(ii) ({na \ n G Z},+) (Example 2.1.4) is a cyclic group, where a is any 
fixed element of Z. 

(iii) (Z n ,+ n ) is a cyclic group since Z n = ([1]}. 
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Example 4.2.3 Let a be a symbol and n a positive integer. Define * by m.eans 
of the following operation table. 


* 

a 0 

a 1 

a 2 

■ • a n “ 2 

a 71-1 

a u 

a u 

a 1 

a 2 

a n ~ 2 

a 71 " 1 

a 1 

a 1 

a 2 

a 3 

a n ~ h 

a 0 

a 2 

1 

a 2 

a 3 

a 4 

a 0 

a 1 

a n ~ 2 

a n_2 

a 7 ^ 1 

a 0 

a n_4 

a 71-3 

a- 1 

a"' 1 

a 0 

a 1 

a n-3 

a n ~ 2 


Then ({a 0 , a 1 ,..., a n 1 }, *) is a cyclic group generated by a 1 . 

Example 4.2.4 Consider the set G = {e,a,6, c}. Define * on G by m.eans of 
the following operation table. 


* 

e a b c 

e 

e a b c 

a 

a e c b 

6 

b c e a 

c 

c b a e 


From, the multiplication table, it follows that ( G , *) is a commutative group. 
However, G is not a cyclic group since 

(e) = {e}, (a) = {e, a}, (b) = {e, &}, and (c) = {e, c } 

and each of these subgroups is properly contained in G. G is known as the 

Klein 4~group. 

The next theorem gives the exact description of a finite cyclic group. 

Theorem 4.2.5 Let (a) be a finite cyclic group of order n. Then (a) = 
{e, a, a 2 ,..., a" -1 }. 


Proof. By Corollary 4.1.9, (a) = { a 1 | i £ Z}. Since (a) is finite, there exist 
i, j £ Z (j > i) such that a 1 = a?. Thus, = e and j — i is positive. Let m 
be the smallest positive integer such that a m = e. Then for all integers i , j such 
that 0 < i < j < m, a 1 a J otherwise = e for some 0 < i < j < m, which 
contradicts the minimality of m. Hence, the elements of the set S = {e, o, a 2 , 
..., a m-1 } are distinct. Clearly S C (a) . Let a k £ (a) . By the division al¬ 
gorithm, there exist integers q , r such that k = qm + r, 0 < r < m. Thus, 
a k = a qm+r = ( a m ) q a T = ea T — a r £ S. Therefore, (a) C S. Thus, S = (a). 
Since the elements of S are distinct and (a) has order n, it must be the case 
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that m = n. ■ 

The following corollaries are immediate from the proof of Theorem 4.2.5. 
We omit the proofs. 

Corollary 4.2.6 Let (a) be a finite cyclic group. Then o (a) = |{a)|. ■ 

Corollary 4.2.7 A finite group G is a cyclic group if and only if there exists 
an element a £ G such that o(a) = \G\. ■ 

As stated in the beginning of this section, cyclic groups have special prop¬ 
erties. We now proceed to discover some of these properties. Subgroups of a 
cyclic group are themselves cyclic; this is proved in the next theorem. 

Theorem 4.2.8 Every subgroup of a cyclic group is cyclic. 

Proof. Let if be a subgroup of a cyclic group G = (a) . II H — {e}, then 
if = (e) and so if is cyclic. Suppose {e} C if. Then there exists b £ if such 
that b e. Since b £ G, we have b = a m for some integer m. Thus, m ^ 0 
since b ^ e. Since H is a group, a~ Tn = 6 _1 £ if. Now either m or —m is 
positive. Therefore, if contains at least one element which is a positive power 
of a. Let n be the smallest positive integer such that a n £ if. We now show 
that if = (a n ) . 

Since a n £ if, we must have ( a n ) C if. Let h £ if. Then h = a k for some 
integer k. By the division algorithm, there exist integers q , r such that k — 
nq + r, 0 < r < n. Since a n and a k £ if, we have a T = a k ~ nq = a k {a n )~ q £ if. 
However, if r > 0, we contradict the minimality of n. Therefore, r = 0 so that 
a k — ( a n ) q £ ( a n ) . Hence, if C (o n ) and so if = ( a n ) . Thus, if is cyclic. ■ 

Corollary 4.2.9 Let G = (a) be a cyclic group of order m , m > 1, and if be 
a proper subgroup of G. Then if = for som,e integer k such that k divides 
m and k > 1. Furthermore, j if | divides m. 

Proof. If if = {e}, then if — (a m ). Suppose that if ^ {e}. Let k be the 
smallest positive integer such that a k £ if. Then if = ( afc ) • Now there exist 
integers q and r such that m ~ qk + r, where 0 < r < k, and 

a r = Q rn—qk = a m a ~-qk = a ~qk = (( a *)-l)9 £ H 

The minimality of k implies that r — 0. Hence, m = qk and so k divides m. 
Since H ^ G, k > 1. Next, we show that j if j divides m. By Theorem 2.1.28(h), 
o(a k ) = gcd ^ y ~ q. As a result Corollary 4.2.6 implies that 

| = o (a k ) = q. 
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Since m = qk , we have g|m, i.e., |ff | divides m. ■ 

By Corollary 4.2.9, if G is a finite cyclic group and H is a subgroup of G, 
then | H | divides |G|. This is a special case of a more general result, called 
Lagrange’s theorem, which we will prove in the next section. 

Let G = (a) be an infinite cyclic group. Then o(a) is infinite and this 
implies that °(a k ) is infinite for any nonzero integer k. Thus, the order of any 
nonidentity element of G is infinite. Let H be a nontrivial subgroup of G. Then 
H is cyclic. Let H = (b). Then b ^ e and b € G and so o(6) is infinite. This in 
turn shows that \H\ is infinite. Thus, every nontrivial subgroup of an infinite 
cyclic group is infinite. 

Now let G = (a) be a finite cyclic group of order n and if be a proper 
subgroup of G. Then by Corollary 4.2.9, \H\ divides |G| .If H = {e}, then 
\H\ = 1 and if H = G, then \H\ = |G| and so \H\ divides \G\. Thus, the order 
of every subgroup of G divides the order of G. The following theorem shows 
that the converse of this result is also true for finite cyclic groups. 

Theorem 4.2.10 Let G be a finite cyclic group of order m. Then for every 
positive divisor d of m, there exists a unique subgroup of G of order d. 


Proof. Let G = (a) and d be a positive divisor of m. Since d\m, there exists 
k G Z such that m = kd. Now a k € G and by Theorem 2.1.28(h), 


o 



°( Q ) 

gcd (fc, m) 


m 

~k 


= d. 


Let H = ( afc ) • Then \H\ = o(a k ) = d. Thus, G has a subgroup of order d. 
Next, we establish that H is unique. 

Let if be a subgroup of order d. Let t be the smallest positive integer such 
that a t € K. Then K = {ah) . Since K is of order d, o(a*) — d by Corollary 
4.2.6. But o(a*) = gcd ^ m) by Theorem 2.1.28(u). Hence, d = gcd ^ m j, which 
implies that gcd(i,m) = ^ = k. This shows that k\t. Let t = kl for some l E Z. 
Now ah — a kl = ( a k ) l 6 H. Hence, K C H. Since |if | = |if | and H and K are 
finite, we have H = K. Thus, there exists a unique subgroup of order d. ■ 


4.2.1 Worked-Out Exercises 
<0 Exercise 1 (Q, +) is not cyclic. 

Solution: Suppose Q is cyclic. Then Q = for some ^ where 

p and q are relatively prime. Since ^ 6 Q, there exists n E Z, n 0 such 
that ^ by Corollary 4.1.9. This implies that ^ — n € Z, which is a 

contradiction. Thus, Q is not cyclic. 
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Exercise 2 Let G be a group such that \G\ = mn , m > 1, n > 1. Show that 
G has a nontrivial subgroup. 

Solution: First suppose that G is cyclic. Let G = (a) . Then o(a) = mn. 
Clearly o(a m ) = n. Let H = (a m ). Then H is a nontrivial subgroup of G. Now 
suppose that G is not cyclic. Then for all a E G, o(a) < mm by Exercise 26 
(page 79). Let ej- a E G and let H = (a) . Then H is a nontrivial subgroup of 
G. 

<0> Exercise 3 Let G be an infinite cyclic group generated by a. Show that 

(i) a r = a 1 if and only if r = t, where r, t E Z, 

(ii) G has exactly two generators. 

Solution: (i) Suppose a T = a 1 and r ^ t. Let r > t. Then a T ~ t = e. Thus, 
o (a) is finite, say, o(a) = n. Then G — {e, a, ..., a n-1 }, which is a contradiction 
since G is an infinite group. The converse is straightforward. 

(ii) Let G = (b ) for some b E G. Since a E G = (b) and b E G = (a) , a = b r 
and b = a 1 for some r, t E Z. Thus, a = b r = (a t ) r = a rt . Hence, by (i), rt — 1. 
This implies that either r = 1 = t or r = — 1 = t. Thus, either b = a or b = a -1 . 
Now from (i), a^a -1 . Therefore, G has exactly two generators. 

<0 Exercise 4 (i) Let G = (a) be a finite cyclic group of order n. Show that 
a k is a generator of G if and only if gcd(fc,n) = 1, where A; is a positive 
integer. 

(ii) Find all generators of Zio- 

Solution: (i) Suppose a k is a generator of G. Since |(T| = n, o (a k ) = n. 
But o(a fc ) = gcd ^ |n) ■ Hence, gcd ^ n) = n. Thus, gcd(/c,n) = 1. Conversely, 

suppose that gcd(/c,n) = 1. Then o(a fc ) — gcd ^ fc n ) = n. Hence, 

Since T G and |(7| = n, G = ^ a k ^ . 

(ii) Now Zio = ([1]) and |Zio| = 10. By (i), k[ 1] is a generator if and only if 
gcd(fc, 10) = 1, where 1 < k < 10. Now if k = 1, 3, 7, or 9, then gcd(fc, 10) = 1. 
Thus, the generators of Zio are 1 [1] = [1], 3[1] = [3], 7[1] = [7] and 9[1] = [9]. 

4.2.2 Exercises 

1. Let G = (a) be a cyclic group of order 30. Determine the following sub¬ 
groups. 

(i) <« 5 > ■ 

(ii) (a 2 ). 
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2. Let G be a cyclic group of order 30. Find the number of elements of order 
6 in G and also find the number of elements of order 5 in G. 

3. Prove that 1 and —1 are the only generators of Z. 

4. (i) Show that (R, +) is not cyclic. 

(ii) Show that (Q*, ■) is not cyclic. 

(iii) Show that (R*, •) is not cyclic. 

5. If (S' is a cyclic group of order n, show that the number of generators of 
G is 0 (n), where 0 is the Euler 0 -function. 

6 . Show that every proper subgroup of S 3 is cyclic. 

7. Give an example of a.noncyclic Abelian group all of whose proper sub¬ 
groups are cyclic. 

,v ,, I j O L-x, 7 y-c. 

8 . Let G be a group. Suppose that G has at most two nontrivial subgroups. 1 
Show that G is cyclic. 

9. Let G be a finite group. Show that if G has exactly one nontrivial sub¬ 
group, then order of G is p 2 for some prime p. 

10. Let G be a noncommutative group. Show that G has a nontrivial sub¬ 
group. 

11. Give an example of an infinite group which contains a nontrivial finite 
cyclic group. 

12. Show that there are cyclic subgroups of order 1,2,3, and 4 in S 4 , but £4 
does not contain any cyclic subgroup of order > 5. 

13. For the following statements, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) For every positive integer n, there exists a cyclic group of order n. 

(ii) Every proper subgroup of A 4 is cyclic. 

(iii) A 3 is a cyclic group. 

(iv) A 4 is a cyclic group. 

(v) All proper subgroups of (R, +) are cyclic. 
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4.3 Lagrange’s Theorem 

In the last section, we noted that the order of a subgroup of a finite cyclic group 
divides the order of the group (Corollary 4.2.9). We also remarked that this is 
a special case of a general result, called Lagrange’s theorem, i.e., the order of a 
subgroup of a finite group divides the order of the group. Lagrange proved this 
result in 1770, long before the creation of group theory, while working on the 
permutations of the roots of a polynomial equation. Lagrange’s theorem is a 
basic theorem of finite group theory and is considered by some to be the most 
important result in finite group theory. In this section, we prove this result. 
We begin with the following definition. 

Definition 4.3.1 Let H be a subgroup of a group G and a € G. The sets 
aH = {ah \ h 6 H} and Ha — {ha \ h € H] are called the left and right 
cosets of H in G , respectively. The element a is called a representative of aH 
and, Ha. 


If G is commutative, then of course aH = Ha. Observe that eH = H = He 
and that a = ae 6 aH and a — ea £ Ha. 


Example 4.3.2 Consider the symmetric group S 3 (Example 3.1.6). Then 


H = 



1 2 3 

2 3 1 


1 2 
3 1 



and 




are subgroups of S 3 . We now compute the left and right cosets of H in S 3 . The 
left cosets of H in S 3 are 


( 1 2 3 
\ 1 2 3 


H = 


1 2 3 

2 3 1 


H = 


12 3 
3 1 2 


H = H 


and 


12 3 

13 2 


H = 


1 2 3 
3 2 1 


H = 


1 2 3 

2 1 3 


H = 


1 2 3 
1 3 2 


1 2 3 

2 1 3 


1 2 3 
3 2 1 


and the right cosets of H in S 3 are 


H 


1 2 3 
1 2 3 


= H 


1 2 3 

2 3 1 


= H 


1 2 3 
3 1 2 


= H 


“PilHjc. TfeoiAewtoiLcn./ 



4.3. LAGRANGE’S THEOREM 


117 


and 


H 


1 2 3 
1 3 2 

1 2 3 
1 3 2 


= H 


1 2 3 
3 2 1 

1 2 3 

2 1 3 


= H 


1 2 3 

2 1 3 

1 2 3 ' ' 

3 2 1 


Thus, for all a 6 S 3 , aH = Ha. 

Next, we compute the left and right cosets of H in S 3 . The left cosets of 
H in S 3 are 

2 3 V = (! ? ? \H =H, 


1 2 3 


1 3 2 


and 


1 2 3 
3 2 1 


1 2 3 

2 1 3 


H = 


H = 


1 2 3 
3 1 2 


1 2 3 

2 3 1 


H = 


H = 


1 2 3 
3 2 1 


1 2 3 

2 1 3 


1 2 3 
3 1 2 


1 2 3 

2 3 1 


and the right cosets of H in S 3 are 


H 


1 2 3 
12 3 


= H 


1 2 3 
1 3 2 


- H 


H 


a,nd 


H 


1 

3 


1 

2 


2 

2 


2 

1 


3 

1 


3 

3 


= H 


= H 


1 

2 


1 

3 


2 

3 


2 

1 


3 

1 


3 

2 


1 

3 


1 

2 


2 

2 


2 

1 


3 

1 


3 

3 


1 2 3 

2 3 1 


1 2 3 
3 1 2 


We see that 


1 2 3 AH' 1 2 3 

3 1 2 j ^ ^ ^ I 3 1 2 


Thus, the left and right cosets of H' in S 3 are not the same. 


There are some interesting phenomena happening in the above example. 
We see that all left and right cosets of H in S 3 have the same number of 
elements, namely, 3; that there are the same number of distinct left cosets of 
H in S 3 as of right cosets, namely, 2; that the set of all left cosets and the set 
of all right cosets form partitions of S 3 ; and, finally, that 3 • 2 equals the order 
of S 3 . Similar statements hold for the subgroup H'. We show, in the results to 
follow, that these phenomena hold in general. 
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In the next few theorems, we prove some properties of left and right cosets 
of a subgroup which will eventually lead us to the proof of Lagrange’s theorem. 
The following theorem tells us when two left (right) cosets are equal. It is a 
result that is used often in the study of groups. 

Theorem 4.3.3 Let H be a subgroup of a. group G and a,b E G. Then 

(i) aH = bH if and only if b~ l a E H. 

(ii) Ha — Hb if and only if ab~ l E H. 

Proof. (i) Suppose aH — bH. Since a E aH and aH = bH , there exists 
h r E H such that a = bh'. This implies that b~ l a = h' E H. 

Conversely, suppose 6 _1 a E H. Then there exists h! E H such that b~ l a — 
h', i.e., a — bh!. Let ah E aH. Then ah = bh'h E bH. This implies that aH 
C bH. Next, we show that bH C aH. Now 6 _1 a — h' implies that ah'~ l = b. Let 
bh E bH. Then bh ~ ah' 1 h, E aH. Hence, bH C aH. Consequently, aH — bH. 
(ii) The proof is similar to (i). We leave it as an exercise. ■ 

Theorem 4.3.4 Let H be a subgroup of a group G. Then for all a, b E G, 
either aH — bH or aH D bH = <f> (i.e., two left, cosets are either equal or they 
are disjoint). 

Proof. Let a, b E G. Suppose that aH fl bH ^ cf. We wish to show that 
aH = bH. Since aH D bH ^ f>, there exists c E aH fl bH. Hence, c E aH and 
c E bH and so there exist h\, 6,2 E H such that c = ah\ and c = 6 / 7 , 2 . Thus, 
ah\ — 6/12 and from this, it follows that b~ l a = h 2 hf 1 . Therefore, b~ l a E H. 
By Theorem 4.3.3(i), aH = bH. ■ 

Corollary 4.3.5 Let H be a subgroup of a group G. Then {aH \ a E G } form,s 
a partition of G. 


Proof. Let V — {aH | a E G}, i.e., V is the set of all left cosets of H in G. 
By Theorem 4.3.4, for all aH, bH E V, either aH = bH or aH DbH = f>. Thus, 

V satisfies (i) of Definition 1.3.14. Since aH C G for all a E G, U a Hev a H C G. 
If a E G, then a E aH C U a H£P a H. Therefore, G C \J a Hev a H. Hence, G = 
U a HeV a H- This shows that V satisfies (ii) of Definition 1.3.14. Consequently, 

V is a partition of G. H 

Theorem 4.3.6 Let H be a subgroup of a group G. Then the elements of H 
are in one-one correspondence with the elements of a,ny left (right) coset, of H 
in G. 
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Proof. Let a be any element of G and aH be a left coset of H in G. To 
show that the elements of H are in one-one correspondence with the elements 
of aH, we show that there exists a one-one function of H onto aH. Define 
/://—> aH by f(h) = ah for all h G H. Let h,h\ G H. If h = h\, then 
ah — ah\, i.e., f(h) = f(h\). Hence, / is well defined. Suppose f(h) = /(hi). 
Then ah = ah\ and this implies that h = h\. Thus, f is a one-one function. 
To show / is onto aH, let ah G aH, where h G H. Then ah — f(h). Hence, / 
maps H onto aH. Similarly, we can show that the elements of H are in one-one 
correspondence with the elements of Ha. ■ 

The following corollary is immediate from Theorem 4.3.6. 

Corollary 4.3.7 Let H be a subgroup of a group G. Then for all a G G, 
\H\ = \aH\ = \Ha\ . ■ 


The next theorem says that there are the same number of left cosets as 
right cosets. 

Theorem 4.3.8 Let H be a subgroup of a group G. Then there is a one-one 
correspondence of the set of all left cosets of H in G onto the set of all right 
cosets of H in G. 

Proof. Let C = {aH \ a G G] be the set of all left cosets of H in G and 
1Z = {Ha | a G G] be the set of all right cosets of H in G. To establish a 
one-one correspondence between the elements of C and 7Z, we need to show 
the existence of a one-one function of C onto 7Z. 

Define / : C —» 1Z by 

f(aH) = Ha~ l 

for all aH € C. First note that Ha~ l € JZ for all a G G. Let aH,bH G C. 
Suppose aH — bH. Then by Theorem 4.3.3(i), b~ l a G H. This implies that 
6 -1 (a -1 ) -1 = b~ l a G H and so by Theorem 4.3.3(h), Hb~ l — Ha~ l . Thus, 
f(bH) = f{aH). Hence, / is well defined. To show / is one-one, suppose 
f(aH ) = f{bH). Then Ha -1 = Hb~^ and so a -1 (6 -1 ) -1 G H by Theorem 
4.3.3(h), i.e., a _1 6 G H. Therefore, b~ l a — (a' 1 ^) -1 G H and so aH = bH. 
Hence, / is one-one. Since for all Ha G 1Z, Ha = H(a~ 1 )~ 1 = f(a~ l H) and 
a~ l H G C, it follows that / is onto 1Z. Thus, / is a one-one function from C 
onto 7 Z. ■ 

Definition 4.3.9 Let H be a subgroup of a group G. Then the number of 
distinct left (or right) cosets, written [G : H), of H in G is called the index of 
H in G. 
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By Theorem 4.3.8, the number of left cosets and the number of right cosets 
of a subgroup H of a group G are the same. Thus, [G : H] is well defined. 

If G is finite, then of course \G : H} is finite. The following example is one, 
where G is infinite and [G : H] is finite. 


Example 4.3.10 Let n be a fixed positive integer. Consider the cyclic subgroup 
((n) , +) of (Z,+). Let k + (n) be a left coset of (n) in Z. By the division 
algorithm,, there exist integers q and r such that k — qn + r, where 0 < r < n. 
Then k — r = qn G (n) and so k + (n) = r + (n) by Theorem, 4.3.3. Suppose 
i + (n) = j + (n) , where 0 < i, j < n. Then i — j E (n) by Theorem, 4.3.3. This 
implies that n\(i — j) and so we must have i — j = 0 or i = j since 0 < i, j < n. 
Thus, the distinct left cosets of ( n) in Z are 0 + (n) , 1 + (n) , ..., n — 1 + (n) . 

We are now ready to prove Lagrange’s theorem. It is interesting to note 
that Lagrange proved the result for the symmetric group S n . Some credit Galois 
for proving the result in general. 

Theorem 4.3.11 (Lagrange) Let H be a subgroup of a finite group G. Then 
the order of H divides the order of G. In particular, 

\G\ = \G : H\\H\. 


Proof. Since G is a finite group, the number of left cosets of H in G is 
finite. Let {a\H, 02 #, . .., a r H} be the set of all distinct left cosets of H in 
G. Then by Corollary 4.3.5, G = U T i=zl aiH and a t H fl a 3 H = (f) for all i ^ j , 
1 < i,j < r. Hence, [G : H] = r and 

\G\ = \ ai H\ + \a 2 H\ + ■ ■ ■ + \a r H\. 

By Corollary 4.3.7, \H\ ~ \aiH\ for all i, 1 < i < r. Therefore, 

|C| = |/f| + |ff| + ... + |ff| 

"-V---' 

t times 

= r\H | 

= \g.h\\h\. 

Thus, the order of H divides the order of G. ■ 


Corollary 4.3.12 Let G be a group of finite order n. Then the order of a.ny 
element a of G divides n and. a n — e. 
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Proof. Let a E G and o(a) — k. Let H = (a) . Then by Corollary 4.2.6, 

|if | = | (a) | = o(a) = k. Hence, by Theorem 4.3.11, k divides n. Thus, there 
exists q E Z such that n — kq. Hence, a n = a kq = ( a k ) q =■ e q — e. ■ 

Let G be a finite group of order n and a E G. Then o(a) divides n by 
Corollary 4.3.12. Thus, to find o(a), we only need to check a k , where k is a 
positive divisor of n. For example, consider Z 20 and [ 6 ] E Z 20 - Now | Z 20 1 = 20 
and 1, 2, 4, 5, 10, and 20 are the only positive divisors of 20. Now 1 [ 6 ] = [ 6 ] 7 ^ [0], 
2[6] = [12] 7 ^ [0], 4[6] = [24] = [4] ^ [ 0 ], 5[6] = [30] = [10] ^ [0], and 
10[6] = [60] — [0]. Thus, o([6]) = 10. Hence, the above corollary can be used to 
find the order of an element in a finite group. 

Corollary 4.3.13 Let G be a group of prime order. Then G is cyclic. 


Proof. Since |G] > 2, there exists a E G such that a 7 ^ e. Let H — (a) . 
Then {e} C H and |if | divides |G|. But |G| is prime and so |if | = |G| . Since 
H C G and |if| = |G|, it follows that G — H. Therefore, G is cyclic. ■ 

G.H. Hardy (1877-1947) believed that no result of number theory would 
have a practical application. However, number theoretic results have recently 
been applied to cryptography, the study of secret codes. The following is such 
a result. It is known as Fermat’s little theorem. 

Theorem 4.3.14 (Fermat) Let p be a prim.e integer and a be an integer such 
that p does not divide a. Then p divides a p ~ l — 1, i.e., 

a p ~ l = p 1. 


Proof. Let U p = Z p \{0}. Then by Exercise 10 (page 78), U p is a group. 
Also, by Exercise 9 (page 78), \U p \ = p— 1. Let a be an integer such that p does 
not divide a. Then [a] is a nonzero element of Z p and so [a] E U p . Thus, by 
Corollary 4.3.12, [a] p_1 = [1], i.e., [a p_1 ] = [1]. Hence, a p_1 = p 1 by Exercise 
11 (page 30). ■ 


Let H and K be subgroups of a group G. If either H or K is infinite, then, 
of course, HK is infinite. Suppose H and K are both finite. We know that HK 
need not be a subgroup of G. Thus, \HK\ need not divide |G|. However, with 
the help of Lagrange’s theorem, we can determine \HK\ . This is a very useful 
result and we will use it very effectively in this text. In the next theorem, we 
determine \HK | when H and K are both finite. 


Theorem 4.3.15 Let H and K be finite subgroups of a group G. Then 


\HK\ - 


\H\\K\ 

\HnK\' 
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Proof. Let us write A = H fl K. Since H and K are subgroups of G ) A is 
a subgroup of G and since A C H, A is also a subgroup of H. By Lagrange’s 
theorem, \A\ divides \H\. Let n = j^j. Then [H : A\ = n and so A has n 
distinct left cosets in H. Let {x\A, X 2 A ,... ,x n A} be the set of all distinct left 
cosets of A in H. Then H — \Jf =l x^A. Since A C K, it follows that 

HK = (U 7 =1 x t A)K = U ? =1 XiK. 

We now show that X{K fl XjK = <f> if i ^ j. Suppose XiK n XjK ^ <fi for some 
i j. Then XjK = XiK. Thus, x~ l Xj £ K. Since x~ l Xj £ H , x ~ 1 Xj £ A 
and so XjA — X{A. This contradicts the assumption that x\A ,..., x n A are all 
distinct left cosets. Hence, x\K ,..., x n K are distinct left cosets of K . Also, 
\K\ = \xiK\ by Corollary 4.3.7 for alH = 1,2, ... ,n. Thus, 

| HK\ = \x 1 K\ + --- + \x n K\ 

= \K\ + -. + \K\ 

' ---' 

n times 

= n\K\ 

- 1 EMI 

- , \a\ 

- M B 

- \HDK ‘ 

The following corollary is an immediate consequence of the above theorem. 

Corollary 4.3.16 Let H and K be finite subgroups of a group G such that 
H OK = {e}. Then 

\HK\ = \H\\K\.U 

4.3.1 Worked-Out Exercises 

0 Exercise 1 Let H be a subgroup of a group G. Show that for all a £ G, 
aH — H if and only if a £ H. 

Solution: Let a £ G. Suppose aH = H. Then a = ae £ aH = H. 
Conversely, suppose that a £ H. Now for any h £ H, ah £ H. Hence, aH C H. 
Let h £ H. Then a~ l h £ H. Thus, h — a(a~ l h ) £ aH. Therefore, H C aif, 
proving that aH — H. 

0 Exercise 2 Let G be a noncyclic group of order p 2 ,pa prime integer. Show 
that the order of each nonidentity element is p. 

Solution: Let g £ G and g e. Now o (g) divides |G| = p 2 . Hence, 
o (g) — l,porp 2 . Since p/e, o (g) ^ 1. If o(g') — p 2 , then G contains an element 
g such that o(p) = |G| and this implies that G is cyclic, which contradicts the 
hypothesis. Hence, o(g) — p. 
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Exercise 3 Let G = {a, b, c, d} be a group, 
table for this group. 

a b c 


a 


Complete the following Cayley 
d 


b 

c 

d 


b 


b 


Solution: From the table, c 2 = b and db = b. Now db — b implies that 
d — e, the identity element of G. Since c 2 = b d, o(c) ^ 2. Hence, o(c) = 4. 
Thus, G is a cyclic group generated by c. Then G = {e , c, c 2 , c 3 }. Since d = e 
and c 2 = 6, it follows that c 3 = a. Hence, the Cayley table is 



a b c d 

a 

b c d a 

b 

c d a b 

c 

d a b c 

d 

abed 


Exercise 4 Let G be a finite nontrivial group. Suppose for all x E G, there 
exists y E G such that x — y 2 . Prove that the order of G is odd and 
conversely. 


Solution: Suppose G is of odd order. Then |G| = 2n + 1 for some positive 
integer n and for all x E G ) x 2n+1 — e. Now x 2n+1 = e implies x = x~ 2n = 
( x ~ n ) 2 = y 2 , where y — x~ n . Conversely, suppose |G| is not odd. Let |G| = 2 n 
and x E G. Then there exists y E G such that x = y 2 . Hence, x n = y 2n — e. 
Thus, for all x E G, x n = e. Suppose n is odd, say, n — 2m. + 1. Then 
x 2m,+1 — e for all x G G. By Worked-Out Exercise 5 (page 74), there exists 
z e G such that z ^ e and z 2 — e since |C?| is even. Hence, e = z 2m+l — 
zz 2m — z(z 2 ) m = ze = z, which is a contradiction. So n is even, say, n = 2m. 
Then x 2m = e for all x € G. As before, we can show that x m = e for all x € G 
and m is even. Continuing in this way, we can conclude that x 2 = e for all 
x E G. Let x E G. Then there exists y E G such that x = y 2 . Therefore, x = e. 
Thus, |G| = 1, which is a contradiction. Consequently, G is of odd order. 

0 Exercise 5 Let G be a group such that |G| > 1. Prove that G has only the 
trivial subgroups if and only if |G| is prime. 


Solution: Let |G| — p, p a prime. Let H be a subgroup of G. Then 
\H\ divides |G| . This implies that \H\ — 1 or p. Thus, H = {e} or H = G. 
Conversely, suppose that G has only the trivial subgroups. Let a E G be such 
that a e. Now (a) = {a n | a E Zj is a cyclic subgroup of G and (a) ^ {e}. 
Therefore, G = (a) . If G is infinite, then a T ^ a s for all r, s E Z, r ^ s. Hence, 
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{a 2n | n G Z} is a nontrivial subgroup of G, which is a contradiction. Thus, 
|G| is a finite cyclic group of order, say, m > 1. Suppose m is not prime. Then 
m — rs for some r, s E Z, 1 < r, s < m. Since r\ \G\ and G is cyclic, G has a 
cyclic subgroup H of order r. This contradicts the assumption that G has only 
the trivial subgroups. Hence, |G| is prime. 

0 Exercise 6 Let G be a group of order p n , p a prime. Show that G contains 
an element of order p. 

Solution: Let a G G, a ^ e. Then H = (a) is a cyclic subgroup of G. Now 
\H\ divides |G| = p n . Thus, \H\ = p m for some m £ Z, 0 < m, < n. Now H 
is a cyclic group of order p m . Hence, for every divisor d of p m , there exists a 
subgroup of order d. So for p, there exists a subgroup T of H such that \T\ = p. 
By Corollary 4.3.13, there exists b 6 T such that T = (b) and b is of order p. 
Hence, G contains an element of order p. 

Exercise 7 Let G be a finite commutative group such that G contains two 
distinct elements of order 2. Show that |G| is a multiple of 4. Also, show 
that this result need not be true if G is not commutative. 

Solution: Let a and b be two distinct elements of order 2. Let H — {e, a} 
and K = {e, 6}. Now H and K are subgroups of G. Since G is commutative, 
HK — {e,a,6, ab} is a subgroup of G of order 4. Now \HK\ = 4 divides |G| . 
Thus, |G| is a multiple of 4. 

The symmetric group S 3 is noncommutative, (12) and (13) are elements 
of S 3 , and each is of order 2. But 4 does not divide \Ss\ = 6. 

Exercise 8 Find all subgroups of S 3 and draw the lattice diagram of the 
subgroup lattice of S 3 . 

Solution: S 3 = {e, (1 2 ), (1 3), (2 3), (1 2 3), (1 3 2 )}. o(l 2 ) = 2 , o(l 
3) = 2, o(2 3) = 2, o(l 2 3) = 3, and o(l 3 2) = 3. Now {e}, {e, (1 2)}, {e, (1 
3)}, {e, (2 3)}, {e, (1 2 3), (1 3 2)}, and S 3 are subgroups of S 3 . Let H be a 
subgroup of S 3 . Now \H\ divides \G\ . Thus, \H\ = 1,2,3, or 6. If \H\ = 1, then 
H = {e}. If \H\ = 6, then H = S 3 . If \H\ — 2, then H is a cyclic group of order 
2. Hence, H is one of {e, (1 2)}, {e, (1 3)}, {e, (2 3)}. Suppose |JT| = 3. Then by 
Lagrange’s theorem, H has no subgroup of order 2. Thus, (1 2), (1 3), (2 3) ^ H. 
Therefore, e, (1 2 3), (1 3 2) € H. Also, {e, (1 2 3), (1 3 2 )} is a subgroup and 
so H = {e, (1 2 3), (1 3 2 )}. Hence, H 0 = {e}, H\ = {e, (1 2)}, H 2 = {e, (1 3)}, 
H 3 — {e, (2 3)}, H 4 = {e, (1 2 3), (1 3 2 )}, and S 3 are the only subgroups of S 3 . 
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Hi 



H a 


4.3.2 

1 . 


2 . 

3. 


4. 

5. 

6. 

7. 


8 . 

9. 

10 . 


Exercises 

In S 3 , 

(i) find all right cosets of H — {e, (2 3)}, 

(ii) find a subgroup B of G such that H(1 2 3) is a left coset of B. 

Find all right cosets of the subgroup 6Z in the group (Z, +). 

Let 

/I 2 3 4 \ fl 2 3 4 Wl 2 3 4 \ 1 

’ V 4 3 2 1 )• [2 1 4 3 J’( 3 4 1 2 ) }’ 

where e is the identity permutation. Show that H is a subgroup of S 4 . 
List all the left and right cosets of H in S 4 . 

Let H denote the subgroup {T 360 , h} of the group of symmetries of the 
square. List all the left and right cosets of H in G. 

Find all subgroups of the Klein 4-group. 

Find all subgroups of order 4 in S 4 . 

Let G = (a, 6, c, d} be a group. Complete the following Cayley table for 
this group. 



d a b c 

d 

d 

a 

c d 

b 


c 




Let G be a group and H and K be subgroups of G. Show that (HC\K)x = 
Hx D Kx for all x G G. 

Let G be a group and H and K be subgroups of G. Let a, b G G. Show 
that either Ha H Kb — (f) or Ha fl Kb = (H fl K)c for some c G G. 

(Poincare) Let G be a group and H and K be subgroups of G of finite 
indices. Show that H Pi K is of finite index. 
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11. Give an example of a group G and a subgroup H of G such that aH = bH, 
but Ha / Hb for some a, 6 € G. 

12. Let G be a group of order pq , where p and q are prime integers. Show 
that every proper subgroup of G is cyclic. 

13. Let H be a subgroup of a group G. Define a relation ~ on G by for all 
a, b £ G, a ~ b if and only if b~ l a £ H (i.e., if and only if aH = bH). 
Show that ~ is an equivalence relation on G and the equivalence classes 
of ~ are the cosets aH, a £ G. 

14. Let n > 1. Show that there exists a proper subgroup H of S n such that 
[S n : H] < n. 

15. Let H and K be subgroups of a finite group G such that \H\ > y/\G\ and 
\K\ > y/\G\. Show that \HnK\>l. 

16. Let |G| = pq, (p > q), where p and q are distinct primes. Show that G 
has at most one subgroup of order p. 

17. Let G be a group. If a subset A is a left coset of some subgroup of G, 
show that A is a right coset of some subgroup of G. 

18. Let G be a finite group and A and B be subgroups of G such that A C 
B C G. Prove that 

[G : A] = [G : B][B : A\. 

19. Let G be a group such that |G| < 200. Suppose G has subgroups of order 
25 and 35. Find the order of G. 


20. Let G be a group of order 35 and A and B be subgroups of G of order 5 

and 7, respectively. Show that G = AB. v '■ ' , 4- 

21. Let A and B be subgroups of a group G. If |A| = p, a prime integer, show 
that either An B — {e} or AC. B. 

22. Let H and K be subgroups of a group G. Define a relation ~ on G by 
for all a, b £ G, a ~ b if and only if b = hak for some h C H and k £ K. 

(i) Show that ~ is an equivalence relation on G. 

(ii) Let a £ G and [o] denote the equivalence class of a in G. Show that 

[a] = {hak \ h E H, k G K} — HaK. 

The set HaK is called a double coset of H and K in G. 
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(iii) If G is a finite group, prove that 


HaK 


W\\K\ 

HDaKa- 1 


for all a £ G. 

23. For the following, if the statement is true, then write the proof. Otherwise 
justify why the statement is false. 

(i) Every left coset of a subgroup of a group is also a right coset. 

(ii) The product of two left cosets of a subgroup of a group is also a left 
coset. 

(iii) There may exist a subgroup of order 12 in a group of order 40. 

(iv) Let G — (a) be a cyclic group of order 30. Then [G : (a 5 )] = 5. ® 

(v) Every proper subgroup of a group of order p 2 (p a prime) is cyclic. 

(vi) Let G be a group. If if is a subgroup of order p and K is a subgroup 
of order q, where p and q are distinct primes, then \HK\ = pq. 


4.4 Normal Subgroups and Quotient Groups 


In the previous section, we saw that a subgroup H of a group G induced two 
decompositions of G, one by left cosets and another by right cosets. In other 
words, if if is a subgroup of a group G, then G can be written as a disjoint 
union of distinct left (right) cosets of if in G. These two decompositions were 
first recognized by Galois in 1831 in the context of permutation groups. Galois 
called the decomposition “proper” if the two decompositions coincide, i.e., if 
left cosets are the same as right cosets. We call such a subgroup normal in 
our present-day terminology. Normal subgroups are the subject of this section. 
Galois showed how the solvability of a polynomial equation by means of radicals 
is related to the concept of a normal subgroup of the group of permutations 
of the roots and the group, called the quotient group, created by the normal 
subgroup. 

Perhaps the notion of a normal subgroup is one of the most innovative ideas 
in group theory. I.N. Herstein (1923-1988) remarked about normal subgroups 
that “It is a tribute to the genius of Galois that he recognized that those 
subgroups for which the left and right cosets coincide are distinguished ones. 
"Very often in mathematics the crucial problem is to recognize and to discover 
what are the relevant concepts; once this is accomplished the job may be more 
than half done.” 

Later C. Jordan defined normal subgroups without using the term normal 
as we define it in our present-day terminology. 
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We shall see in this text that normal subgroups play a crucial role in ob¬ 
taining structural results of groups. Let us now begin our study of normal 
subgroups. 


Definition 4.4.1 Let G be a group. A subgroup H of G is said to be a normal 
(or invariant) subgroup of G if aH — Ha for all a £ G. 

From the definition of a normal subgroup, it follows that for any group G, 
G and {e} are normal subgroups of G. 

If if is a normal subgroup of G, this does not always mean that ah — ha 
for all h £ H and for all a £ G as shown by the following example. 


Example 4.4.2 Recall Example 4.3.2. H is a normal subgroup of S 3 . Consider 
1 2 3 


h = 


2 3 1 


£ H. Then 


and 


Hence, 


even though 


( 1 2 3 
^ 1 3 2 


oh, — 


1 2 3\ 

3 2l) 


h o 


1 2 3 
13 2 


1 2 3 \ 

2 1 3 )' 


( 1 2 3 
V 1 3 2 


o h ^ ho 


12 3 ^ 

132 y 7 


/ 12 3 
^ 1 3 2 


H = H 


1 2 3\ 

1 3 2 J ‘ 


The following theorem gives a necessary and sufficient condition for a sub¬ 
group to be a normal subgroup. For a £ G, 0 7^ H C G, let aHa ~ 1 = { aha -1 
| h£H). 


Theorem 4.4.3 Let H be a subgroup of a group G. Then H is a normal 
subgroup of G if and only if for all a £ G, aHa~ l C H. 


Proof. First suppose that H is a normal subgroup of G. Let a £ G. We 
now show that aHa~ l C H. Let aha~ l £ aHa~ 1 , where h £ H. Since H is a 
normal subgroup of G , aH = Ha. Also, since ah £ aH, we have ah £ Ha and 
so ah — h'a for some h! £ H. Thus, aha~ l = h! £ H. Hence, aHa~ l C H. 

Conversely, suppose aHa~ l C H for all a £ G. Let a £ G. We show that 
aH — Ha. Let ah £ aH, where h £ H. Now aha~ l £ aHa~ l and so aha~ l £ H. 
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Thus, aha,- 1 = h' for some h' £ H. This implies that ah = h'a £ Ha. There¬ 
fore, aH C Ha. Similarly, we can show that Ha C aH. Hence, aH = Ha. 
Consequently, H is a normal subgroup of G. ■ 

There are several other criteria that can be used to test the normality of 
a subgroup. We consider some of these criteria in exercises at the end of this 
section. 

The following theorem describes some important properties of normal sub¬ 
groups. 

Theorem 4.4.4 Let H and K be norm,al subgroups of a group G. Then 

(i) H D K is a normal subgroup of G, 

(ii) HK = KH is a normal subgroup of G, 

(in) (HUK)= HK. 

Proof. (i) Since the intersection of subgroups is a subgroup, H D K is a 
subgroup of G. Let g £ G. Consider g{H D K)g~ 1 . Let gag -1 be any element 
of g(H fl K)g~ 1 , where a £ H fl K. Since a £ HdK, we have a £ H and a £ K. 
Hence, gag- 1 £ H and gag- 1 £ K. Thus, gag -1 £ H fl K. This shows that 
g^Hn^g- 1 C H fl K. Hence, H fl K is a normal subgroup by Theorem 4.4.3. 

(ii) First we show that HK = KH. Let hk £ HK, where h £ H and k £ K. 
Since If is a normal subgroup of G and h £ G, we have h,K = Kh. Thus, 
hk £ h,K = Kh. Since Kh C KH, we have hk £ KH. Hence, HK C KH. 
Similarly, KH C HK and so HK — KH. Since H and K are subgroups and 
HK — KH, HK is a subgroup of G by Theorem 4.1.13. To show that HK is 
a normal subgroup, let g £ G. Then gHg- 1 C H and gKg- 1 C K since H and 
K are normal subgroups. Now 

g(HK)g~ l = g(Hg-'gK)g 

= (gHg-^gKg- 1 ) 

C HK. 

Therefore, HK is a normal subgroup of G by Theorem 4.4.3. 

(iii) By (ii), HK is a subgroup of G. Hence, by Theorem 4.1.15, 

HK = (H U K ). M 

We know that if H and K are subgroups of a group G, then HK need 
not be a subgroup of G (Example 4.1.12). By the above theorem, if H and K 
are normal subgroups, then HK is & normal subgroup and hence a subgroup. 
However, in order to show that HK is a subgroup, we only need either H or 
if to be a normal subgroup. We consider one of these situations in Exercise 
13 (page 137). 


“Public. 
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In Theorem 4.1.16, we proved that the set of all subgroups of a group G is 
a lattice. In the next theorem, we prove that the set of all .normal subgroups 
of a group G is a modular lattice. 

Corollary 4.4.5 Let Af{G) denote the set of all normal subgroups of a group 
G. Then (J\f(G), <) is a modular lattice, where < is the set inclusion relation. 


Proof. Proceeding as in Theorem 4.1.16, we can show that (A/"(G), <) is 
a lattice, where H A K — H D K and H V K = (H U K) = HK for all 
H,K £ N{G). Let H,K,L £ Af(G) be such that H < L. We now show that 
H(KDL) = HK Pi L. Since H C HK and H C L, we find that H C tfKnL. 
Also, K D L C K C HK and KnL C L. As a result ifnL C HWnL, showing 
that H(K fl L) C HK n L. Let a € HK Pi L. Then a € LTiF and a £ L. Thus, 
a — hk for some h £ H and k £ K. This implies that E L and so 

k £ KC\L. Hence, a £ H(KDL), which implies that HKnL C H(KDL). Con¬ 
sequently, we must have H(K DL) = HKnL, i.e., iLV (K A L) — (H V K) A L. 
Hence, (J\f(G), <) is a modular lattice. ■ 


We now focus our attention on the study of quotient groups. First, let us 
consider the following example. 


Example 4.4.6 Consider the subgroup H' of Example 4.3.2. Now H' is not a 
normal subgroup of S 3 . Let 63 / H' be the set of all left cosets of H' in S 3 . Now 
let us try to define a binary opera.ti.on * on S 3 /H'. The natural way would be 
to define (jriH') * ( 7:2 H ') to be (71-1 o7T2 )H'. Now 


and 


However, 


( 1 2 3 
\ 3 2 1 


H' - 


1 2 3 
3 1 2 


H' 


( 1 2 3 
V 2 1 3 


H' = 


1 2 3 

2 3 1 


H'. 




1 2 3 
3 2 1 



1 2 3 
3 1 2 


H 1 


and 


1 2 3 

2 3 1 



1 2 3 
3 1 2 



1 2 3 
1 2 3 


H'. 


( 12 3 

\ 3 1 2 


H'^ 


1 2 3 
1 2 3 


H f 




JPiLfuc. 


Since 
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* is not well defined. That * is not well defined, is due to the fact that H f is 
not a normal subgroup of S 3 . 

Theorem 4.4.7 Let H be a norm.al subgroup of a group G. Denote the set 
of all left cosets {aH \ a £ G} by G/H and define * on G/H by for all aH, 
bH £ G/H, 

(aH) * (bH) = abH. 

Then ( G/H ,*) is a group. 

Proof. First we show that * is well defined. Let aH, bH, a'H, b'H £ G/H 
and suppose (aH, bH) = (a'H, b'H). Then aH = a'H and bH = b'H. We 
need to show that aH * bH = a'H * b'H or abH = a'b'H. Now aH = a'H and 
bH — b'H imply that a — a'hi and b = b'h 2 for some hi, h ,2 £ H. Thus, 

(a'b')~ l (ab) = b'~ 1 a'~ 1 ab 

— b'~ 1 a'~ 1 a'hib'h 2 

= b'~ l hib'h 2 . 

Since H is a normal subgroup and hi £ H, we have b'~ l hib'h 2 = (b'~ l hib')h 2 £ 
H and so (a'b')~ l (ab) £ H. Hence, abH = a'b'H by Theorem 4.3.3(i). Thus, * 
is well defined and so (G/H, *) is a mathematical system. 

Next, we show that * is associative. Let aH, bH, cH £ G/H. Now (aH) * 
[(bH) * ( cH )] = (aH) * (bcH) = a(bc)H = (ab)cH = (abH) * (cH) = [(aH) * 
(bH)] * (cH). Hence, * is associative. Now eH £ G/H and 

(aH) * (eH) = aeH = aH = eaH = (eH) * (aH) 

for all aH £ G/H. Therefore, eH is the identity of G/H. Also, for all aH £ 
G/H, a~ l H £ G/H and 

(aH) * (a~ l H) = aa~ l H = eH = a~ l aH = (a~ l H) * (aH). 

Thus, for all aH £ G/H, a~ l H is the inverse of aH. Consequently, (G/H, *) is 
a group. ■ 

Definition 4.4.8 Let G be a group and H be a normal subgroup of G. The 
group G/H is called the quotient group of G by H. 

Example 4.4.9 Consider the subgroup ((n) ,+) of the group (Z,+), where n 
is a fixed, positive integer. Since Z is commutative, (n) is a norm.al subgroup 
ofZ (Exercise 15, page 137). Hence, (Z/ (n) , +) is a group, where 

(a + (n)) + (b + (n)) — (a + b) + (n) 

for all a+ (n) , 6+ (n) £ Z/ (n) . In Example 4.3.10, we determined the distinct 
left, cosets of (n) in Z. We found that 

Z/ (n) = {0 + (n) , 1 + (n), 2 T (n),..., n — 1 + (n)}. 

PuJlc. TfeatAewtafLca/ 
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Example 4.4.10 Consider the normal subgroup H of S 3 of Example 4.4.2. 
Since |£ 3 1 = 6 and \H\ — 3, [63 : H] = 2 by Lagrange’s theorem,. Now \Ss/H\ = 
[S 3 : H] = 2 and for all h E H, hH = H. Thus, eH = H, (1 2 3 )H = H and 
(1 3 2 )H = H. We ha,ve shown in Example 4.3.2 that (2 3 )H = (1 3 )H = (1 
2 )H. Thus, 

S 3 /H = {if, (2 3 )H} . 

We also note that S 3 /H is cyclic and (2 3)H is a generator for S 3 /H. 

Example 4.4.11 Consider Z& and let H = {[0], [4]}. Then H is a norm,a,l 
subgroup of Zq. Now \H\ = 2 and \Z&\ = 8 . Thus, \Z$/H\ = = 4. Hence, 

Zq/H has four elements. Now 

[0]+H = H=[4}+ H, 

[1] + ff = {[!]. [5]} = [5] + if, 

[ 2 ] + if = {[ 2 ], [ 6 ]} = [ 6 ] + if, 

and 

[3] + if = {[3], [7]} = (7] + if. 

Hence, Z 8 /if = {[0] + if, [1] + H , [2] + H, [3] + if}. 

Example 4.4.12 Consider Z 4 x Zq, the direct product 0 /Z 4 and Zq. Let 

H = (([0], [1])) = {([0], [0]), «0], [1]), ([0], [ 2 ]), ([0], [3]), ([0], [4]), ([0], [5])}. 

Then H is a subgroup of Z 4 x Zq and since Z 4 x Zq is commutative, H is a 

norm,a,l subgroup of Z 4 x Zq. Now [Z 4 x Zq\ = 24 and \H\ = 6 . Hence, 

|(Z 4 xZ 6 )/ii| = n^ = 4. 

Thus, (Z 4 x Zq)/H has four elements. Since for all [n] E Zq, ([0], [n]) E H, 

we have for all [n] E Zq, ([0],[n]) + H — H. Let ([m], [n]) E Z 4 x Zq. Then 

([m], [n]) = ([m], [0]) + ([0], [n]) and from, this, it follows that ([m], [n]) + H — 
([m], [0])+ff. Let us now compute ([m], [0])+i/ form = 0,1,2, 3. Now ([0], [0])+ 
H = H, 

([1], [0]) +H = {([1], [0]), ([1], [1]), ([1], [2]), ([1], [3]), «1], [4]), ([1], (5])}, 

([2], [0]) + H = {([2], [0]), ([2], [1]), ([2], [2]), ([2], [3]), ([2], [4]), ([2], [5])}, 

and 

([3], [0]) + if = {([3], [0]), ([3], (1]), ([3], [2]), ([3], [3]), ([3], [4]),([31,[5])}. 

From, above, we see that ([0], [0])+//, ([1], [0]) + //”, ([ 2 ], [0])+ff, and ([3], [0 ])+H 
are all distinct. Hence, 

(Z 4 x Z q)/H - {([0], [0]) + H, ([ 1 ], [0]) + H, ([ 2 ], [0]) + H, ([3], [0]) + H}. 

~Pu*uc- TfeoiAewtoi^Lca/ 
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Groups of the type given in the next definition are building blocks for all 
groups. They are important because they help to determine the structure of 
groups. We will discuss this in more detail when we introduce the concept of 
a composition series of a group (Chapter 8 ). 

Definition 4.4.13 Let G be a group. Then G is called simple if G / {e} and 
the only normal subgroups of G are {e} and G. 

The only simple commutative groups are given in the next example. We 
will determine the simple groups of order < 60 (in Section 7.4). 

Example 4.4.14 Let G be a cyclic group of order p, p a prime. Since the only 
subgroups of G are {e} and G , G is simple. 

We now proceed to establish the simplicity of A n , n > 5. Thus, there is a 
large class of simple groups. 

Lemma 4.4.15 Let H be a norm,al subgroup of A n , n > 5. If H contains a 
3 -cycle, then H — A n . 

Proof. Suppose H contains a 3-cycle, say, (a b c) E H. Let (u v w) E A n 
and let 7r E S n be such that tt(o) — u, n(b) = v , and 7r(c) = w. Now 7r o (a b 
c) o tt ~ 1 = (u v w). If tt E A n , then (u v w) E H. Suppose 7r ^ A n . Then 7T is 
an odd permutation. Since n > 5, there exist d, f E I n such that d and / are 
distinct from a, b and c. Then tt o (d f) £ A n . Now (u v w) = tt o (a b c) o7r _1 = 
7T o (a b c) o (d f) o (d f)~ l ° 7T~ l = tt o (d /) o (a b c) o (d f)^ 1 o tt~ 1 — (7T o (d 
/)) o (a b c) o (tt o (d f))~ l G H. Thus, H contains all 3-cycles. Since A n is 
generated by the set of all 3-cycles, H = A n . ■ 

Theorem 4.4.16 Let H be a norm,al subgroup of A n , n > 5. If H contains a 
product of two disjoint transpositions, then H — A n . 


Proof. Suppose (a b) o (c d) G H , where (a b) and (c d) are disjoint trans¬ 
positions. Let w G I n be such that w £ {a,b,c,d}. Let tt = (c d w). Since 7r is 
a 3-cycle, tt E A n . Since id is a normal subgroup of A n , we have it o fab) o { c 
d) o 7T —1 E H. But 

tt o (o b) o (c d) o 7 r -1 — (d, w) o (a 6) 
and so (d w ) o (a 6) E H. Since H is a subgroup, 

(c d w) = (a 6) o (c d) o (d uf) o (a 6) G H. 

Hence, H contains a 3-cycle and so by Lemma 4.4.15, H — A n . ■ 

Theorem 4.4.17 A n is simple if n > 5. 

“PuJuc. 7feaiAe»tafLca/ 
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Proof. Let H be a normal subgroup of A n and H ^ {e}. Let 7r G id, 7r ^ e 
be a permutation that moves the smallest number of elements, say, m. Then 
m > 3. We claim that m ■= 3, in which case the result follows by Lemma 4.4.15. 
Suppose m > 3. Write 7r = 7Ti o 772 o ■ ■ • o as a product of disjoint cycles. 

Suppose that 7fj is a transposition for all 2 = 1,2, ..., k. Then k > 2. Let 
7rj = (a b ) and 7r2 = (c d). Let f £ I n be such that / ^ {a, 6 , c, d} and let a — (c 
d /). Since a £ A n and id is a normal subgroup of A n , n' — n' 1 oaoTroa^ 1 £ H. 
Clearly n' (a) = a and 7r / (6) — b. If u £ I n and n ^ {a, 6, c, d,/} is such that 
7 r(u) = u, then ir'(u ) = u. Since 7r'(/) = c, 7r' ^ e. Thus, 7r' £ H, it' e, 
and 7r' moves fewer elements than i r, which is a contradiction. Hence, for some 
i, 1 < i < k, 7Ti is a cycle of length > 3. Since disjoint cycles commute, by 
renumbering if necessary, we may assume that i = 1. Then i rj = (o 6 c •■■). 
If m = 4, then 7r is a cycle of length of 4 and hence an odd permutation, 
a contradiction. Thus, m > 5. Hence, 7r moves at least five elements. Let d, 
f £ I n and d, / ^ {a, 6, c}. Let a = (c d /). As before, 7r' = 7r _1 octottoct^ 1 G H. 
Since Tr^b) = 7r _1 (d) ^ 6, 7r' ^ e. Now for any u ^ {a, 6, c, d, /}, if tt(u) = u, 
then 7 r'(u) — u. Clearly 7r '(a) = a. Hence, tt' moves fewer elements than 7r, 
which is again a contradiction. Hence, m. = 3. B 


4.4.1 Worked-Out Exercises 

0 Exercise 1 Let id be a subgroup of a group G. Then W = P\ g ^G 9 Hg~ l is 
a normal subgroup of G. 

Solution: By Worked-Out Exercise 1 (page 106), gHg~ 1 is a subgroup 
of G for all g £ G. Since the intersection of subgroups is a subgroup, W is a 
subgroup of G. Let x £ G, w £ W. Then w £ gHg~ l for all g £ G. We show 
that xwx£ gHg~ l for all g £ G, which in turn will yield that xwx~ l £ W. 
Let g £ G. 

Let us work our way backward and suppose xwx~ l £ gHg~ l . Then xwx~ l — 
ghg~ l for some h £ H. Thus, g~ l xw x~ l g = h £ id. This implies that 

(g~ 1 x)w(g~ 1 x)~ 1 £ H. 

Set y — x~ l g. Then g = xy. Hence, in order to show that xwx' 1 £ gHg' 1 for 
a given g £ G, first we need to find y £ G such that g = xy. Since g = x(x' 1 g ), 
we can choose y = x' 1 g. 

So there exists y £ G such that g = xy. Since y £ G, we have w £ yHy -1 
and so w — yhy~ l for some h £ H. Therefore, xwx' 1 = x(yhy~ 1 )x~~ 1 = 
xyhy~ 1 x~ 1 — ( xy)h(xy ) -1 = ghg' 1 £ gHg~ 1 . Since g £ G was arbitrary, 
xwx' 1 £ gHg' 1 for all g £ G. Consequently, W is a normal subgroup of G. 


Exercise 2 Let id be a subgroup of G. 

~PuAuc- TfeoiAewtadtcn./ 
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(i) If x 2 £ H for all x £ G, prove that H is a normal subgroup of G and 
G/H is commutative. 

(ii) If [G : H] — 2, prove that H is a normal subgroup of G. 

Solution: (i) Let g £ G and h £ H. Consider ghg~ l and note that 

ghg~ l = ( gh) 2 h~ l g ~ 2 . 

Now h~ l £ H and by our hypothesis ( gh ) 2 , < 7~ 2 € H. This implies that ghg~ l £ 
if, which in turn shows that gHg~ l C H. Hence, H is a normal subgroup of 
G. To show that G/H is commutative, let xH,yH £ G/H. We wish to show 
that xHyH = yHxH or xyH = yxH or (yx)^ 1 (xy) £ H. Consider (; yx)~ l {xy ). 
Now 

(yx)~ 1 (xy) = (x^ l y~ 1 )(xy) = (x~ l y~ 1 ) 2 (yxy~ 1 ) 2 y 2 . 

Since a 2 £ H for all a £ G, it follows that (x~ 1 y~ 1 ) 2 {yxy~ 1 ) 2 y 2 £ H and so 
(yx)~^(xy) £ H. Thus, G/H is commutative. 

(ii) We prove that H is a normal subgroup of G first by showing that 
x 2 £ H for all x £ G and then by using (i). Suppose there exists x £ G such 
that x 2 H. Then x £ H and so H and xH are distinct left cosets of H in 
G. Since [G : H] = 2 , it follows that G/H = {H, xH}. Hence, G = H U xH. 
This implies that x 2 £ H U xH. Since x 2 (/ H , we must have x 2 £ xH . Hence, 
x 2 = xh for some h £ H. But then x = h £ H, which is a contradiction. Hence, 
x 2 £ H for all x £ G. By (i), H is a normal subgroup of G. 

Exercise 3 Let G be a group such that every cyclic subgroup of G is a normal 
subgroup of G. Prove that every subgroup of G is a normal subgroup of 
G. 

Solution: Let if be a subgroup of G. Let g £ G and a £ H. Then 
g~ 1 ag £ (a) C H. Hence, H is normal in G. 

Exercise 4 Let if be a proper subgroup of G such that for all x,y £ G\H , 
xy £ if. Prove that if is a normal subgroup of G. 

Solution: Let x £ G\H. Then a; -1 6 G\H. Let y £ H. Then xy £ G\H. 
Thus, xy, x" 1 G G\H. Hence, xyx -1 £ if. Therefore, if is a normal subgroup 
of G. 

<0 Exercise 5 Let G be a group and {Ni \ i £ li} be a family of proper normal 
subgroups of G. Suppose G = UiNi and Ni D Nj = {e} for i 7 ^ j. Prove 
that G is commutative. 
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Solution: Let x,y £ G. Then there exist i and j such that x £ Ni and 
y £ Nj. If i ^ j, then since NiC\Nj = {e}, xy = yx (Exercise 12, page 137). Let 
i = j. Now there exists z £ G such that z ^ Ni. Then zx £ Ni. Hence, zx £ Ni 
for some l ^ i and so (zx)y = y(zx). Thus, z{xy) = ( zx)y = y(zx) = ( yz)x = 
(zy)x — z(yx). This implies that xy = yx. Consequently, G is commutative. 

Exercise 6 Let H be a subgroup of a group G. Suppose that the product of 
two left cosets of H in G is again a left coset of H in G: Prove that H is 
a normal subgroup of G. 

Solution: Let g £ G. Then gHg~ l H = tH for some t £ G. Thus, e — 
geg~ l e £ tH. Hence, e — th for some h £ H. Thus, t = h~ l £ H so that 
tH = H. Now gHg~ l C gHg~ l H = H. Therefore, H is a normal subgroup of 
G. 

<0> Exercise 7 Let G be a group. Show that if G/Z{G ) is cyclic, then G is 
commutative. 

Solution: Write Z — Z(G). Let GfZ — (gZ). Let a,b £ G. Then aZ, bZ £ 
GfZ. Hence, aZ — g n Z and bZ = g ra Z for some n,m £ Z. Then a £ g n Z and 
b £ g^Z. Thus, a = g n d and b - g^h for some d,h £ Z. Now ab = g n dg m h = 
g n g Tn dh (since d £ Z) = g n+Tn hd (since h £ Z) = g Tn g n hd = g rn hg n d = ba. 
Hence, G is commutative. 

4.4.2 Exercises 

1. Let 

1 2 3 4 \ ( 1 2 3 4 W 1 2 3 4 \ \ 

4 3 2 1 )’ b 2 1 4 3 Jd 3 4 1 2 J )’ 

where e is the identity permutation. Determine whether or not H is a 
normal subgroup of S 4 . 

2. Let H denote the subgroup {r 36 o, h } of the group of symmetries of the 
square. Determine whether or not H is a normal subgroup of G. 

3. Let G be a group and H be a subgroup of G. Show that H is normal if 
and only if ghg~ l £ H for all g £ G, h £ H. 

4. Let G be a group and H be a subgroup of G. If for all a,b £ G, ab £ H 
implies ba £ H , prove that H is a normal subgroup of G. 

5. Let H be a proper subgroup of a group G and a £ G, a H. Suppose 
that for all b £ G, either b £ H or Ha = Hb. Show that H is a normal 
subgroup of G. 
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6 . Let G be a group. Prove that Z(G) is a normal subgroup of G. 

7. Let G be a group. Let H be a subgroup of G such that H C Z(G). Show 
that if G/H is cyclic, then G = Z(G ), i.e., G is commutative. 

8 . Let H and K be subgroups of a group G such that H is a normal subgroup 
of G. Prove that H fl K is a normal subgroup of K. 

9. Determine the quotient groups of 

(i) (E, +) in (Z, +), 

(ii) (Z, +) in (Q, +), 

(hi) (([4]) ,+ 12 ) in (Zi 2 ,+i 2 )- 

10. Let H be a normal subgroup of a group G. Prove that if G is commuta¬ 
tive, then so is the quotient group G/H. 

11 . Let H be a nonempty subset of a group G. The set N(H ) = {a G G \ 
aHa~ l — H] is called the normalizer of H in G. 

(i) Prove that N(H) is a subgroup of G. 

Suppose if is a subgroup of G. 

(ii) Prove that H is normal in G if and only if N(H) — G. 

(iii) Prove that H is normal in TV (if). 

(iv) Prove that N(H) is the largest subgroup of G in which H is normal, 
i.e., if H is normal in a subgroup K of G , then K C iV(ff). 

12. Let H and K be normal subgroups of a group G. If H n K — {e}, prove 
that hk = kh for all h H and k E K. 

13. Let G be a group. Let ii be a subgroup of G and K be a normal subgroup 
of G. Prove that HK is a subgroup of G. 

14. Give an example of a noncommutative group in which every subgroup is 
normal. 

15. Show that every subgroup of a commutative group is normal. 

16. Let if be a normal subgroup of a group G such that \H\ — 2. Show that 
H C Z(G). 

17. Show that if H is the only subgroup of order n in a group G, then H is 
a normal subgroup of G. 

18. Let K = {e, (1 2) o (3 4), (1 4) o (3 2), (1 3) o (2 4)}. 

(i) Show that K is the only subgroup of order 4 in A 4 . 

(ii) Show that ii is a normal subgroup of A 4 . 
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19. Show that A 4 has no subgroup of order 6. 

20. Find all subgroups of A 4 . Draw the subgroup lattice diagram. Is this 
lattice a modular lattice? 

21 . Let G be a commutative group. Show that G is simple if and only if G 
is of prime order. 

22 . Let G be a group. An equivalence relation p on G is called a congruence 
relation if 

for all a, 6, c E G , apb implies that capcb and acpbc. 

Let if be a normal subgroup of G. Define the relation pn on G by 
for all a, b E G, apnb if and only if a~ l b E H. 

Prove that 

(i) ph is a congruence relation on G , 

(ii) the ph class apn = {b E G | apnb} is the left coset aH , 

(hi) H = ep H - 

23. Let if be a subgroup of a group G. Define a relation pH on G by pn — 
{(a, b) E G x G | a~ l b G if}. Show that if pn is a congruence relation, 
then if is a normal subgroup of G. 

24. Let p be a congruence relation on a group G. Show that there exists a 
normal subgroup if of G such that p = {(a, b) G G x G | a -1 6 G if}. 

25. For the following statements, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) A subgroup H of a group G is a normal subgroup if and only if every 
right coset of if is also a left coset. 

(ii) If A , B and C are normal subgroups of a group G, then A(B fl C ) is 
a normal subgroup of G. 

(iii) If A is a normal subgroup of a finite group G, then [G : A\ = 2. 

(iv) Every commutative subgroup of a group G is a normal subgroup of 
G. 

(v) If G is a group of order 2 p, p an odd prime, then either G is commu¬ 
tative or G contains a normal subgroup of order p. 
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Joseph Louis Lagrange (1736-1813) 
was born on January 25, 1736, in Turin, 

Italy. He spent the early part of his life in 
Turin. While there he was involved in car¬ 
rying out research work in calculus of varia¬ 
tions and mechanics. 

In 1766, Lagrange was invited by the 
Prussian king, Frederick II, to fill the po¬ 
sition vacated by Euler in Berlin. Freder¬ 
ick the Great proclaimed in his appointment 
that “the greatest king in Europe” ought 
to have “the greatest mathematician in Eu¬ 
rope.” In 1787, after the death of Frederick 
II, he went to Paris, accepting an invitation 
from Louis XVI. In 1797, he accepted a po¬ 
sition at the newly formed Ecole Polytechnique in Paris. He was made a count by 
Napoleon and remained at the Ecole Polytechnique till his death. He died on April 
10, 1813. 

Throughout his life, Lagrange did work of fundamental importance. He made 
numerous contributions to many branches of mathematics, including number theory, 
the theory of equations, differential equations, celestial mechanics, and fluid mechanics. 
In 1770, he proved the famous Lagrange’s theorem in group theory. 

He is responsible for the work leading to Galois theory. In his paper, “Reflexion sur 
la theorie algebriques des equations,” Lagrange carefully analyzed the various known 
methods to solve a polynomial equation of degree < 4 by means of radicals. He was 
interested in finding a general method of solution for polynomials of higher degree. 
He was unable to find a general solution, but in his paper he introduced several key 
ideas on the permutations of roots which finally led Abel and Galois to develop the 
necessary theory to answer the question. Lagrange’s work on the solution of polynomial 
equations is one of the sources from which modern group theory evolved. 






Chapter 5 

Homomorphisms and 
Isomorphisms of Groups 


One of the main uses of the concept of an isomorphism is the classification 
of algebraic structures—in particular, groups. Readers with some knowledge 
of linear algebra may recall that the concept of an isomorphism is used to 
completely characterize vector spaces with the same field of scalars in terms 
of a single integer, the dimension of the vector space. Another important use 
of an isomorphism is the representation of one algebraic structure by means 
of another. This is done in linear algebra, where it is shown that the vector 
space of all linear transformations from one finite dimensional vector space into 
another is isomorphic to a certain vector space of matrices. 

5.1 Homomorphisms of Groups 

In this section, we consider certain mappings between groups. These mappings 
will be defined in such a way as to preserve the algebraic structure of the groups 
involved. More precisely, suppose we are given a function / from a group G into 
a group Gi, where *1 denotes the operation of G\ . Let a, b € G. Then under 
/, a corresponds to /(a), b to /(&), and a * b to /(a * b). If / is to preserve 
the operations of G and G\ , a * b must correspond to /(a) *1 f(b). Since / is a 
function, this forces the requirement that f(a*b ) = /(a) *1 /(6). 

Definition 5.1.1 Let ((?,*) and (C?i,*i) be groups and f a function from, G 
into G\. Then f is called a homomorphism of G into G\ if for all a,b 6 G, 

f(a*b) = f(a) *! /(&). 

Let the identity element of the group G\ be denoted by ei. 

Define / : G —> G\ by /(a) = e\ for all a € G. Since f(a * b) = e\ = 
ei ei = f{a) *i f(b) for all a, b E G, we find that / is a homomorphism from 
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G into G i. This shows that there always exists a homomorphism from a group 
G into a group G\. This homomorphism is called the trivial homomorphism. 

The identity map from G onto G is also a homomorphism. 

Before we consider more examples of homomorphisms, let us prove some 
basic properties of homomorphisms. 

Theorem 5.1.2 Let f be a hom.om,orphism of a group G into a, group G\. 

Then 

(i) f(e) = ei. 

(ii) /(a -1 ) = /(a) -1 for all a E G. 

(in) If H is a subgroup of G , then f(H) = {/(/i) \ h E if} is a subgroup of 

G\. 

(iv) If Hi is a subgroup of G\, then f~ l {H\) = {g E G \ f(g) E H\} is 
a subgroup of G, and if Hi is a norm,a,l subgroup, then / _ 1 (ifi) is a normal 
subgroup of G. 

(v) If G is commutative, then f(G) is commutative. 

(vi) If a E G is such that o(a) = n, then o(/(a)) divides n. 

Proof. (i) Since / is a homomorphism, /(e)/(e) = /(ee) = /(e) = /(e)ei. 

This implies that /(e) = ei by the cancellation law. 

(ii) Let a E G. Then /(a)/(a -1 ) = f{aa~ l ) = /(e) = e\. Similarly, 
/(a^ 1 )/(a) = ei. Since /(a) has a unique inverse, /(a -1 ) — /(a) -1 . 

(iii) Let if be a subgroup of G. Then e E if and by (i), /(e) = e\. Thus, 
ei = /(e) E /(if) and so /(if) ^ <f>. Let f(a),f(b) E f{H), where a,b £ H. 

Since ii is a subgroup, ab~ l E H. Thus, f(a)f(b)~ l = f^f^b -1 ) = f(ab _1 ) E 
/(if). Hence, by Theorem 4.1.3, /(if) is a subgroup of G\. 

(iv) By (i), e E / _ 1 (ifi) and so / - 1 (ifi) ^ <f. Let a, b E / - 1 (ifi). Then /(a), 
f(b) E Hi. Hence, /(at* 1 ) = /(a)/(6' 1 ) = f(a)f{b)~ 1 E ifi and so ab~ l E 
/ _ 1 (ifi). Thus, by Theorem 4.1.3, / - 1 (ifi) is a subgroup of G. Suppose Hi 
is a normal subgroup of G i- Let g E G. We now show that gf~ l (Hi)g~ l C 
/ - 1 (ifi). Let a E gf~ 1 (Hi)g~ 1 . Then a = gbg~ l for some b E / - 1 (ifi). Now 
/W = /(p&fiT 1 ) = /(p)/( fo )/(y~ 1 ) = f{g)f(b)f{g)~ 1 E ifi since ifi is a 
normal subgroup of G/ and /(&) E ifi- Hence, a E /~ L (ifi) and this shows 
that <?/ - 1 (ifi )<? -1 Q / _ 1 (ifi). Thus, / - 1 (ifi) is a normal subgroup of G. 

(v) Suppose G is commutative. Let /(a), f(b) E f{G). Then /(a)/(6) = 

/(a&) = f{ba) = f(b)f(a). Hence, f(G) is commutative. 

(vi) Since (/(a)) n = /(a 71 ) = /(e) = ei, we have o(/(a)) divides n by 
Theorem 2.1.28. ■ 

Definition 5.1.3 Let f be a hom,omorphism, of a group G into a group G/. 

The kernel of /, written Ker f, is defined to be the set 

Ker / = {a E G | f(a) = ei}. 
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By Theorem 5.1.2, e G Ker /. 

Example 5.1.4 Define the function f from. (Z, +) into (Z n , + n ) by /(a) = [a] 
for all a G Z. From, the definition of /, it follows that f ma,ps Z onto Z n . Let 
a, 6 G Z. XTien 

/(a + 6 ) = [a + 6] = [a] [b] = f(a) + n f(b). 

Thus, f is a hom,om,orphism, of Z onto Z n . /Void 

iTer f = {a G Z | /(a) = [0]} 

= {a G Z j [a] = [0]} 

= {aez \ a is divisible by n } 

= {a G Z | a = qn for some q G Z} 

= {p I 9 e Z}. 

The above example shows that a nontrivial finite group may be an image 
of an infinite group under a homomorphism. By Theorem 5.1.2(v), a noncom- 
mutative group cannot be an image under a homomorphism of a commutative 
group. In the next example, we show that two finite groups G and G\ having 
same number of elements need not have a homomorphism from G onto G\. 

Example 5.1.5 The groups Z 4 x Z 4 and Z$ x Z 2 are comm,utative and each is 
of order 16. Suppose there exists a hom,om,orphism, f 0 /Z 4 X Z 4 onto Zs x Z 2 . 
Now a = ([7], [0]) G Z% x Z 2 and o (a) =8. Since f is onto Z% x Z 2 , there exists 
b G Z 4 x Z 4 such that f(b ) = a. By Theorem 5.1.2 (vi), °(f(b)) divides o (b). 
Since o(/(6)) = 8 and Z 4 x Z 4 has elements of order 1, 2, and 4 only, o(/(6)) 
cannot divide o(6). This is a contradiction. Hence, there does not exist any 
homom.orphism, from, Z 4 x Z 4 onto Zg x Z 2 . 

Definition 5.1.6 Let G and G\ be groups. A hom,omorphism. f : G ^ G 1 is 
called, an epimorphism if f is onto G\ and f is called a, Tnonom,orphism if 
f is one-one. If there is an epimorphism, f from, G onto G 1 , then G 1 is called 

a homomorphic image of G. 

The homomorphism in Example 5.1.4 is an epimorphism, but not a monomor¬ 
phism. 


Example 5.1.7 Let R* be the group of all nonzero real numbers under mul¬ 
tiplication. Define f : R* —» R* by f(a) = |a|. Now f(ab) = \ab\ = |a| | 6 | = 
/(a)/( 6 ), which implies that f is a hom,om,orphism.. Since /( 1 ) = 1 = /(— 1 ) 
and 1 7 ^ — 1 , / is not one-one. Also, from the definition of f, it follows that f 
is not onto R*. Hence, f is neither an epim.orphism nor a monomorphism,. 

The following theorem gives a necessary and sufficient condition for a ho¬ 
momorphism to be a one-one mapping in terms of its kernel. 


Theorem 5.1.8 Let f be a hom,om,orphism, of a group G into a group G 1 . 
Then f is one-one if and only if Ker f = {e}. 
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Proof. Suppose / is one-one. Let a E Ker /. Then /(a) = e\ = /(e) by 
Theorem 5.1.2(i). Since / is one-one, we must have a — e. Hence, Ker / — {e}. 
Conversely, suppose that Ker / = {e}. Let a, 6 E G. Suppose f(a ) = f(b). 
Then 

/(a 6 ~ 1 ) = f(a)f(b~ l ) = f(a)f(b)~ l = ei. 

Thus, a 6 _1 E Ker / = {e} and so ab _1 = e, i.e., a = b. This proves that / is 
one-one. ■ 

Theorem 5.1.9 Let f be a hom,omorphism of a group G into a group G\. 
Then Ker f is a normal subgroup of G. 


Proof. Since e E Ker /, Ker f ^ <f>. Let a, b E Ker f. Then f(ab 1 ) = 
f(a)f(b~ 1 ) = f(a)f(b)~ 1 = ei(ei ) -1 = e\e\ — e 1 . Thus, a 6 _1 E Ker / and 
hence Ker / is a subgroup of G by Theorem 4.1.3. Let a E G and h E Ker 
/. Then /(a/m -1 ) = /(a)/(h)/(a“ 1 ) = f(a)f(h)f(a)~ 1 = f(a)eif(a)~ l = e 1 . 
Therefore, a/ia _1 E Ker /. This proves that aKer /a -1 C Ker f. Hence, Ker / 
is a normal subgroup of G by Theorem 4.4.3. ■ 


Example 5.1.10 Let GL{ 2,R) = 


a b 
c d 


a, b, c, d E R, ad — be ^ 0 


the noncommutative group of Example 2.1.10. Let R* be the group of all nonzero 
real numbers under multiplication. Define f : GL{2 , R) —» R* by 


f ( e d )= ad ~ bc 


for all 


a b 
c d 


E (?L(2, R). Let 


a b 
c d 


u v 


w s 


E GL{ 2,R). Now 



au -f bw av + bs 
cu + dw cv -1- ds 
(i au + bw) (cv + ds) — (av -f bs) (cu + dw) 
(ad — 6 c) (-us — vw) 





This proves that f is a hom,om,orphism„ To show that f is onto R*, let a E RC 


Then 


a 0 


0 1 


E GL( 2, R) and f 


a 0 
0 1 


= a. Hence, f is onto R*. Since 


a 0 
0 1 


= a = f 


a 1 
0 1 


a 0 
0 1 


a 1 


0 1 


, / is not one- 
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The previous example shows that there may exist a homomorphism of a 
noncommutative group onto a commutative group. 

Example 5.1.11 Consider S3 and the normal subgroup 

HOThT’OT}- 

Define f : £3 — > S3/H by for all -k £ S3, /( tt) = 7r H. Then 

f(ir o tt') — (tt o 7 t')H ~ (ttH) o (tt'H) — f( tt) o f( tt') 

for all 7T, tt' 6 S3. Hence, f is a, hom,om,orphism. Also, Ker f = {a £ S3 | 
aH = H} = {a £ S 3 \ a 6 H} = H. 

In Theorem 5.1.9, we showed that if / is a homomorphism of a group into a 
group G\, then Ker / is a normal subgroup of G. In the following theorem, we 
show that every normal subgroup if of a group induces a homomorphism g of 
G onto the quotient group GJH such that Ker g = H. We note that in Example 
5.1.11, the conclusion did not depend on the nature of S3. The conclusion was 
made by use of general arguments. This also leads us to the following theorem. 

Theorem 5.1.12 Let H be a normal subgroup of a group G. Define the func¬ 
tion g from. G onto the quotient group G/H by g(a) — aH for all a € G. Then 
g is a homom,orphism. of G onto G/H and Ker g = H. (The hom.om.orphism, g 
is called the natural homomorphism of G onto G/H.) 


Proof. From the definition of g , it follows that g is a function from G onto 
G/H. To show g is a homomorphism, let a, b € G. Then g(ab) — (ab)H = 
(aH)(bH) — g(a)g(b). Hence, g is a homomorphism of G onto G/H. Finally, 
we show that Ker g — H. Now a € Ker g if and only if g(a) = eH if and only 
if aH — eH if and only if e~ l a £ H if and only if a (E H. Thus, Ker g = H. ■ 


We now define a particular type of homomorphism between groups in order 
to introduce the important idea of groups being algebraically indistinguishable. 

Definition 5.1.13 A h om,om,orphism, f of a. group G into a group G\ is called 
an isomorphism of G onto G 1 if f is one-one and. onto G\. In this case, we 
write G — G\ and. say that G and G\ are isomorphic. An isomorphism of a. 
group G onto G is called an automorphism. 


For a group G , Aut(G), denotes the set of all automorphisms of G. 

In the following theorem, we collect some properties of isomorphisms, which 
will be useful in determining whether given groups are isomorphic or not. 
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Theorem 5.1.14 Let f be an isomorphism, of a group G onto a group G 
Then 

(i) f~ l : G\ —> G is a.n isom.orph.ism.. 

(n) G is commutative if and only if G\ is commutative. 

(Hi) For all a G (7, o(a) = o(/(a)). 

(iv) G is a. torsion group if and. only if G i is a, torsion group. 

(v) G is cyclic if and only if G\ is cyclic. 


Proof. (i) Since / is one-one and onto Gf~ 1 is one-one and onto G. Now 
we only need to verify that f~ l is a homomorphism. Let u, v G G\. Then there 
exist a ,6 6 G such that f{a) = u and f(b) = v. This implies that a — f~ 1 (u ), 
6 = / _ 1 (u), and uv = f(a)f(b) = f(ab). Thus, = ab = / —1 (xi)/ — x (v) 

and so / _1 is a homomorphism. Hence, f~ l is an isomorphism. 

(ii) Suppose G is commutative. Let u,v G G\. Since / is onto G\, there 
exist a, b G G such that /(a) = u and f(b) — v. Now uv — /(a)/( 6 ) = /(a 6 ) ~ 
/(6a) — /( 6 )/(a) = vu. Thus, G\ is commutative. Conversely, suppose G\ is 
commutative. Let a, 6 G G. Now f(ab) — f(a)f(b) — f(b)f(a) = f(ba). Since 
f is one-one, we have ab = ba. This proves that G is commutative. 

(iii) Let a G G. By induction, it follows that for all positive integers n, 
f(a n ) — (/(a)) n . Since / is one-one, for all 6 G G, f(b) — e\ if and only if 
b — e. Hence, a n = e if and only if (/(a)) n = e\. Thus, a is of finite order if and 
only if f{a) is of finite order. Suppose o(a) — m and o(/(a)) = n. Since a m = e, 
(/(a )) 771 = ej. By Theorem 2.1.28, n divides m. Also, (/(a )) 71 = ex implies that 
a n — e. Hence, m divides n. Since m and n are both positive integers and m, 
divides n and n divides m, it follows that m. — n. 

(iv) This follows immediately by (iii). 

(v) Suppose G is cyclic. Then G = (a) for some a G G. Since /(a) G G i, 
(/(a)) Q G\. Let 6 G G\. Since / is onto G i, there exists c G G such 
that /(c) = 6. Now c — a n for some n G Z. Thus, 6 = /(c) — /(a n ) = 
(/(a)) n G (/(a)) . Hence, Gh = (/(a)) and so Gi is cyclic. The converse follows 
since / -1 is an isomorphism. ■ 


In order to develop a feel for two groups being algebraically indistinguish¬ 
able, let us consider two sets S and S' such that there is a one-one function / of 
S onto S'. Then in a set-theoretic sense, S and S' are the same sets “under /”. 
For instance, let A and B be subsets of S. Then f(A) and f(B) are correspond¬ 
ing subsets of S'. Now f(A fl B) — f{A) fi f(B) and f(A U B) — f(A) U f(B ); 
that is, union and intersection are preserved under /. Other purely set-theoretic 
operations can be seen to be preserved under / also. Now suppose binary oper¬ 
ations * and *' are defined on S and S' , respectively, so that ( S , *) and (S', *') 
are groups. Now even though S and S' are the same sets “under /,” they need 
not be the same as groups, i.e., / may not preserve operations. We have seen 
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that the requirement for / to preserve operations is that f(a*b ) = /(a) *' f(b) 
for all a, b £ S. 

We now consider examples of groups that are isomorphic and examples of 
groups that are not isomorphic. 

Example 5.1.15 Let n be a positive integer. Define f from Z n into Z/ ( n) 
by for all [a] £ Z n , /([a]) = a + (n) . Then [a] = [b] 7 / and only if n|(a — b ) 
if and only if a — b = nq for some g € Z if and only if a — b £ (n) if and 
only if a + (n) - b + (n) if and only if /([a]) = /([&]). Therefore, we find 
that f is a one-one function. From the definition ofit follows that f mnps 
Z n onto Z / (n) . Now /([a] + n [b]) = /([a + 6]) — (a + 6) + (n) = (a + (n))+ 
(6+ (n)) — /([a]) + /([&]). Thus, f is an isomorphism, of Z n onto Zj in) . 

Example 5.1.16 Consider the sets G = {e,a, 6 , c} a 7 ?,d = {1, —1, i, —i}. 

Define * and ■ on G and G \, respectively, by moans of the following operation 
tables. 


* 

e a b c 

e 

e a b c 

a 

a e c b 

b 

b c e a 

c 

c b a e 



Now G\ is a cyclic group generated by i. G is also a group. However, since 
aa — e, bb = e, and cc = e, no element of G has order 4 and so G is not cyclic. 
Thus, G and G\ are not isomorphic. 


Example 5.1.17 Let (R,+) be the group of real numbers under addition and 
(R + ,-) be the group of positive real numbers under multiplication. Define f : 
R —► R + by f(a) — e a for all a E R. Clearly f is well 'defined,. Let a, b e R. 
Then f(a+b) = e aJrb = e a e b = f(a)f(b). Hence, f is a homomorphism.. Suppose 
f(a) = fib). Then e a — e b and so log e e a = log e e b . This implies that a = b, 
whence f is one-one. Let b £ R + . Then log e b £ R and /(log e 6 ) = e ]ogeb = b. 
Thus, f is onto R + . Consequently, f is a.n isomorphism. o/(R, +) onto (R + , •). 


Example 5.1.18 Consider the groups (Z,+) and (Q,+). By Worked-Out Ex¬ 
ercise 1 (page 113), (Q,+) is not cyclic. Since (Z,+) is cyclic and. (Q, -h) is 
not cyclic, (Z,+) is not isomorphic to (Q,+) by Theorem, 5.1.14(t;). 
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Example 5.1.19 The group (Q,+) is not isomorphic to (Q*,-) since every 
nonidentity element of (Q, -j-) is of infinite order while —1 is a nonidentity 
element of (Q*, •) which is of finite order. 

Let us now characterize finite and infinite cyclic groups. 

Theorem 5.1.20 Every finite cyclic group of order n is isomorphic to (Z n , + n ) 
and every infinite cyclic group is isom,orphic to (Z,+). 

Proof. Let ((a) , *) be a cyclic group of order n. Let G = (a) . Define 
the function / : G —► Z n by for all a 1 E G, f{a l ) = [i]. Now a 1 = a- 7 if and 
only if a 7- * — e if and only if n\(j — i) if and only if [i] = [j] (Exercise 11, 
page 30) if and only if f(a l ) = /(a 7 ). Thus, / is a one-one function. Now 
f(a l a J ) = f(a l+J ) = [i + j] = [i] + n [j] = f(a l ) + n /(a 7 ). Since f is one-one 
and G and Z n are finite with same number of elements, / is onto Z n . Hence, 
G~Z n . 

Now let G = (a) be an infinite cyclic group. Define the function / : G —> Z 
by f(a}) = i for all i E Z. Since a 1 = a - 7 if and only if a z ~ 7 = e if and only 
if i — j = 0 (since a is of infinite order) if and only if % = j, we have that / 
is a one-one function of G into Z. From the definition of /, / is onto Z. Now 
/(aV) = /(a*+ 7 ) = i + j = f(a*) + /(a 7 ). Hence, G ~ Z. ■ 

Corollary 5.1.21 Any two cyclic groups of the same order are isom,orphic. ■ 

From the above corollary, it follows that there is only one (up to isomor¬ 
phism) cyclic group having a prescribed order. 

In Example 5.1.16, we saw that there are at least two nonisomorphic groups 
of order 4. We now show that these are exactly two nonisomorphic groups of 
order 4. 

Let G be a group of order 4 which is not cyclic. (Example 5.1.16 shows 
that such a group exists.) Then no element of G can have order 4, for if a E G 
has order 4, then e, a, a 2 , a 3 would be distinct elements of G and thus G would 
be cyclic, i.e., G = (a) . This is contrary to the assumption that G is not cyclic. 
Let G = {e,a,5,c}. Since the order of every element of G divides the order 
of G , a, 6 , and c have order 2. If ab = a, then b = e, a contradiction. Thus, 
ab 7 ^ a. Similarly, ab ^ b. Suppose ab = e, then a(ab) = ae. Therefore, b = a 
since a 1 = e, a contradiction. Thus, ab — c. Similarly, ba = c. Hence, ab = ba. 
By similar arguments, we have ac — b — ca and be — a = cb. Thus, we find 
that G is a commutative group and its operation table is given by the table in 
Example 5.1.16. Consequently, there is essentially one group of order 4 which 
is not cyclic. This is the Klein 4-group. Since all cyclic groups of the same 
orders are isomorphic, we thus have exactly two nonisomorphic groups of order 
4, namely, the Klein 4-group and the cyclic group of order 4. We have thus 
proved the following result. 
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Theorem 5.1.22 There are only two groups of order 4 (up to isomorphism.), 
a cyclic group of order 4 and K 4 (Klein 4-group). 

Since every cyclic group is commutative and every group of prime order is 
cyclic, it follows that that if a group is noncommutative, then it must have order 
at least 6 . Indeed, the symmetric group S 3 is noncommutative and of order 6 . 
Since all cyclic groups of the same order are isomorphic and since every group 
of prime order is cyclic, there is exactly one group of order 1 , 2, 3, 5 (up to 
isomorphism), respectively. We have seen that there are two nonisomorphic 
groups of order 4. In the next theorem, we show that there are only two (up 
to isomorphism) nonisomorphic groups of order 6 . 

Theorem 5.1.23 There are only two (up to isomorphism.) groups of order 6. 


Proof. The group Zq is a cyclic group of order 6 and S 3 is a noncommutative 
group of order 6 . Note that Zq is not isomorphic to S3. To show that there are 
only two (up to isomorphism) nonisomorphic groups of order 6 , we will show 
that any group of order 6 is isomorphic to either Z 6 or S3. 

Let G be a group of order 6. Since |( 7 | is even, there exists a € G, a ^ e 
such that a 2 = e. If x 2 = e for all x E G, then G is commutative and for any 
two distinct nonidentity elements a and b , {e,a, b , ab} is a subgroup of G. Since 
|(jr| = 6, G has no subgroups of order 4 . Hence, there exists b E G such that 
b 2 ^ e, i.e., b e and 0(6) ^ 2 . Since o( 5 )| 6 , 0(6) = 6 or 3 . If 0(6) = 6, then 
G = (b) is a cyclic group of order 6 and G ~ Zq. Suppose G is not cyclic. Then 
0(6) = 3 . Let H = {e, b, b 2 }. Then H is a subgroup of G of index 2 . Thus, H 
is a normal subgroup of G. Clearly a ^ H. Now G = H U aH and H D aH = <f. 
Hence, G = {e, b , b 2 , a , ab, ab 2 }. Now aba~ l E H since H is normal and 6 E H. 
Therefore, aba~ l = e or aba _1 — 5 or aba~ l ~ b 2 . If aba~ l = e, then b = e, 
which is a contradiction. If aba~ l = 6, then ab — ba. Since o(a) and 0(6) are 
relatively prime and ab = ba, o [ab) = o(a) • 0(6) = 6. Thus, G is cyclic, a 
contradiction. Hence, aba~ l = b 2 . Thus, G = (a, 6) , where o(a) = 2 , 0(6) = 3 , 
and aba -1 = b 2 . It is now easy to see that G ~ £3. ■ 

We conclude this section by proving Cayley’s theorem, which says that any 
group can be realized as a permutation group. 

Let a be an element of a group G. Define the function f a : G —> G by 
for all b E G, f a (b ) = ab. Then b — c if and only if ab = ac if and only if 
f a {b) — fa{c). Thus, f a is a one-one function of G into G. For any b E G, 
^(a^ 1 ^) = a(a~ 1 b) = b. So we find that f a maps G onto G. Hence, f a is a 
permutation of G. Let F(G) — {f a \ a G G }. Then F(G) is a subset of the set 
S{G) of all permutations on G. Recall that ( S(G ), o) is a group. 

As previously mentioned,pearly mathematicians worked only with groups 
of permutations. The following theorem says that every group is isomorphic 
to a group of permutations of its own elements. In fact, we will show that 
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(F(G), o) is a group which is isomorphic to G. First let us note that for all 
b E G, / 0 -i( 6 ) = a - 1 6 , and / a (a _ 1 6 ) = b implies (/ a ) -1 (b) = a~ 1 b. Thus, 

(/.r 1 = /.-*• 

Theorem 5.1.24 (Cayley) For any group G, (F(G), o) a, group and G — 
F{G). 

Proof. We first show that (F(G), o) is a group. It suffices to show that 
F(G) is a subgroup of ( 5 (G), o). Let f a ,fb £ F(G). Then (/ a o /.“^(c) = 
(fa° fb-i)(c) = fa(fb-i(c)) = /a(^ _ 1 c) = a( 6 _ 1 c) = (a 6 _1 )c = f ab -i(c) for all 
c E G and so f a o f ^ 1 = f ab -i E F(G). Hence, F(G) is a subgroup by Theorem 
4.1.3. Define g : G —> F(G) by for all a E G, g(a) = / a . Then a = 6 if and 
only if ac = be for all c E G if and only if / Q (c) = f b (c) for all c E G if and 
only if f a — fb if and only if g{a) = g( 6 ). This proves that g is a one-one 
function of G into F(G). Clearly g maps G onto F(G). Now g(ab) = f ab and 
g(a) o g(b) — f a o f b . Also, for all c E G, f ab {c) = (ab)c = a( 6 c) = / a (bc) = 
fa{fb(c )) = (/a O /b)(c). Thus, f ab = f a o f b . Hence, g(ab) = g(a) o g( 6 ) and so 
^ is an isomorphism. ■ 

Cayley’s theorem is another example of a representation theorem. However, 
Cayley realized that the best way of studying general problems in group theory 
was not necessarily by the use of permutations. 

5.1.1 Worked-Out Exercises 

<0 Exercise 1 Let / : G —* G\ be an epimorphism of groups. If if is a normal 
subgroup of G, then show that f{H ) is a normal subgroup of G\. 

Solution: By Theorem 5.1.2, we find that f(H ) is a subgroup of G\. Let 
9 \ ^ G\. Since / is onto Gi, there exists g E G such that f(g) = g\. Let 
a € gif{H)gf l = f{g)f(H)f(g)~ 1 . Then a = f(g)f(h)f(g )- 1 = f(ghg ~ l ) for 
some h E H. Since H is a normal subgroup of G, ghg~ l E H and so a E f{H). 
Thus, g\f(H)gf l C f{H). Hence, f(H) is a normal subgroup of G\. 

0 Exercise 2 Let G and H be finite groups such that gcd(|G| , \H\) = 1. Show 
that the trivial homomorphism is the only homomorphism from G into 
H. 

Solution: Let / : G —*■ H be a homomorphism and let a E G. We show that 
every element of G is mapped onto the identity element of iL, i.e., f(a ) — e# 
for all a E G, where en denotes the identity element of H. Now o(a)| |G| and 
°(/(a))| \H\. Also, by Theorem 5 . 1 . 2 , o(/(a))| o-(a). Hence, o(/(a))| |G|. Since 
|G| and \H\ are relatively prime, o(/(a)) = 1, proving /(a) = e#. Thus, / is 
the trivial homomorphism. 
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0 Exercise 3 Show that the group (Q,+) is not isomorphic to (Q/Z,+). 

Solution: In (Q,+), every nonzero element is of infinite order. Let ^ + 
Z £ Q/Z, where p, g £ Z and q 7 ^ 0. Then q(^ + Z) = p + Z = Z. This shows 
that every element of Q/Z is of finite order. Hence, (Q, +) is not isomorphic 
to (Q/Z, +). 

Exercise 4 Show that R*, the group of all nonzero real numbers under mul¬ 
tiplication, is not isomorphic to C*, the group of all nonzero complex 
numbers under multiplication. 

Solution: In the group C*, i is an element of order 4. But R* does n~ + 
contain any element of order 4. Hence, by Theorem 5.1.14, R* is not isomorphic 
to C*. 


0 Exercise 5 Find all homomorphisms from Zq into Z 4 . 

Solution: Zg = ([!]) - Let / : Zq —> Z 4 be a homomorphism. For any [a] £ 
Zq, /([a]) = a/([l]) shows that / is completely known if /([l]) is known. Now 
°(/([l])) divides o([l]) and 4, i.e., °(/([l])) divides 6 and 4. Hence, o(/([l])) — 1 
or 2. Thus, /([l]) = [0] or [2], If /([l]) — [0], then / is the trivial homomorphism 
which maps every element to [0]. On the other hand, /([l]) = [ 2 ] implies that 
/([a]) = [ 2 a] for all [a] £ Z 6 . Thus, /([a] + [ 6 ]) — /([a + 6 ]) = [2(a + 6 )] = 
[2a + 26] = [ 2 a] + [26] = /([a]) + /([6]), proving that the mapping f : Zg —> Z 4 
defined by /([a]) — [2a] for all [a] £ Zq is a homomorphism. Hence, there are 
two homomorphisms from Zq into Z 4 . 

Exercise 6 Let O' be a finite commutative group. Let n £ Z be such that n 
and |(j| are relatively prime. Show that the function 0 : G —» G defined 
by 0(a) = a n for all a £ G is an isomorphism of G onto G. 


Solution: Let a, 6 £ G. Now 

4 >{ab) — ( ab) n 

— a n b n (since G is commutative) 

= 0M0(&)- 

This implies that 0 is a homomorphism. Let 0(a) = 0(6). Then a n = b n and so 
(a 6 _1 ) n = e. Therefore, o(a 6 -1 ) divides n. Since o(a 6 -1 ) divides \G\ and n and 
|(?| are relatively prime, o(a 6 ~ 1 ) = 1. This implies that a 6~ 1 = e, i.e., a = 6 , 
proving that 0 is one-one. Since G is a finite group and 0 is one-one, 0 is onto 
G. Hence, 0 is an isomorphism of G onto G. 
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Exercise 7 (i) Let G be a group and / : G —» G be defined by /(a) = a n for 
all a £ G, where n is a positive integer. Suppose f is an isomorphism. 
Prove that a n_1 € Z{G ) for all a £ G. 

(ii) Let G be a group and f : G G defined by for all a £ G, /(a) — a 3 
be an isomorphism. Prove that G is commutative. 

Solution: (i) Let a,b £ G. Then f(a~ l ba ) = ( a~ 1 ba) n — a _ 1 6 n a. Thus, 

a~ n b n a n = f(a~ l )f(b)f(a) = f(cT l ba ) = a _ 1 6 n a. 

Hence, a”^' 1 ) b n a ™" 1 - b n or (a-fa^fra "" 1 ) 71 - 6 n . Thus, - 

/(5). Since / is one-one, a - ( n- 1 ) 6 a n_1 — 6 . Hence, a n_1 6 = 6 a n_1 , proving that 

n " 1 E Z(G). 

(ii) By (i), a 2 6 Z(G) for all a £ G. Let a,b £ G. Then f(ab) = ( ab) 3 = 
ab(ab) 2 = a(ab) 2 b — aababb — a 2 bab 2 = ba 2 b 2 a = bb 2 a 2 a = 6 3 a 3 = f(b)f(a ) = 
/( 6 a). Hence, a 6 — 5a since / is one-one. Thus, (7 is commutative. 

5.1.2 Exercises 

1. Determine whether the indicated function f is a homomorphism from the 
first group into the second group. If / is a homomorphism, determine its 
kernel. 

(i) /(a) = a 2 ; (R+, ■), (R + , •) for all a £ R + . 

(ii) /(a) — 2 a ; (R, +), (R + , •) for all a £ R. 

(hi) f(a) = |a|; (R\{0}, ■), (R + , •) for all a € R\{0}. 

(iv) f (a) — a T 1; (Z,+), (Z, +) for all a £ Z. 

(v) /(a) = 2a; (Z,+), (Z,+) for all a 6 Z. 

2. Find all homomorphisms from Z into Z. How many homomorphisms are 
onto? 

3. Find all homomorphisms from Z onto Z§. 

4. Find all homomorphisms from Zg into Z 12 and from Z 20 into Z\q. 

5. Show that Q*, the group of all nonzero rational numbers under multipli¬ 
cation, is not isomorphic to R*, the group of all nonzero real numbers 
under multiplication. 

6 . Show that (Q,+) is not isomorphic to (R,+). 

7. Show that (Z,+) is not isomorphic to (R,+). 

8 . Let G be a group. Define the function / : G —► G by for all a £ G , /(a) = 
a -1 . Prove that / is a homomorphism if and only if G is commutative. 
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9. Let G = {(a, 6 ) | a, 6 € R, 6 7 ^ 0}. Then (G, *) is a noncommutative group 
under the binary operation (a, 6 ) * (c, d) — (a + 6 c, bd) for all (a, 6 ), (c, d) € 
G. Let H = {(a, 6 ) 6 G \ a — 0} and K = {(a, 6 ) E G | 6 > 0}. Show that 
77n77 ~ (R + , 0 . where (R + , •) is the group of all positive real numbers 
under multiplication. 

10. Let G = {a £ R | — 1 < a < 1}. Show that (G, *) ~ (R,+), where the 
binary operation * on G is defined by 

a + b 

a * b — --- 

l+a 6 

for all a, 6 € G. 

11. (i) Let / be a homomorphism from a cyclic group of order 8 onto a cyclic 
group of order 4. Determine Ker /. 

(ii) Let / be a homomorphism from a cyclic group of order 8 onto a cyclic 
group of order 2. Determine Ker /. 

12. Prove that a homomorphic image of a cyclic group is cyclic. 

13. Show that S 3 and Z 6 are not isomorphic groups, but for every proper 
subgroup A of S 3 there exists a proper subgroup B of Z 6 such that 
A ~ B. 


14. Let G, 77, and K be groups. Suppose that the functions / : G —» H and 
g : H —> K are homomorphisms. Prove that g o f : G —> K is also a 
homomorphism. 

15. Let G and H be groups. Define the function / : G x H —► G by for 
all (a, 6) £ G x H, f((a,b)) — a. Prove that / is a homomorphism from 
G x H onto G. Determine Ker /. 

16. Let / : G —» H be an isomorphism of groups. Prove that / _1 : H —> G 
is also an isomorphism of groups. 

17. Let G, 77, and 77 be groups. Prove that 

(i) G x H ~ 77 x G. 

(ii) If G ~ 77 and 77 ~ 77, then G ~ 77. 

(iii) G x (77 x 77) ~ (G x 77) x 77. 

18. Let G and 77 be groups. Let / : G —* H be a homomorphism of G onto 
77. Show that if G = (S') for some subset S of G, then 77 = (/(S)). 

19. Let / : G —> 77 be an isomorphism of groups. Show that for any integer 

/c and for any g G G, the sets ^4 = {a € G | = g] and B = {6 E 77 | 

— fid)} have the same number of elements. 
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20. Let G be a simple group and ip : S n G be an epimorphism for some 
positive integer n. Prove that G ~ Sk for some k <n. 

21. Which of the following statements are true? Justify. 

(i) A cyclic group with more than one element may be a homomorphic 
image of a noncyclic group. 

(ii) There does not exist a nontrivial homomorphism from a group G of 
order 5 into a group H of order 4. 

(iii) The group (Z,+) is isomorphic to (Q,+). 

(iv) There exists a monomorphism from a group of order 20 into a group 
of order 70. 

(v) There exists an epimorphism of (R,+) onto (Z,+). 

(vi) There does not exist any epimorphism of (Q,+) onto (Z,+). 

(vii) If / and g are two epimorphisms of a group G onto a group H such 
that Ker / = Ker g, then / = g. 

5.2 Isomorphism and Correspondence Theorems 

In this section, we continue our study of isomorphisms. Our objective is to 
prove the fundamental theorem of homomorphisms, the isomorphism theorems, 
and the correspondence theorem. These theorems show us the relationship 
between homomorphisms and quotient groups. 

Theorem 5.2.1 Let f be a homom,orphism. of a group G onto a group G\, H 
be a norm,al subgroup of G such that H C Ker /, and g be the natural hom.o- 
m,orphism of G onto G/H. Then there exists a unique hom,om,orphism h of 
G/H onto G i such that f = ho g. Furthermore, h is one-one if and only if 
H = Ker f. 



.Proof. Define h : G/H —> G\ by 

h(aH) = f(a) 


for all aH G G/H. 
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Now aH — bH implies b~ l a £ H C Ker / and so f(b~ l a) = e\ or f(a ) = 
f(b). Hence, h{aH) = h{bH) and so h is well defined. Let a £ G. Then 
(h o g)(a) = h(g(a)) = h{aH) — f(a). Therefore, h o g — f. Since / maps 
G onto G \, h must map G/iL onto Gp Now h((aH)(bH)) = h{fab)H ) = 
/(a6) = f(a)f(b) = h(aH)h(bH). Hence, h is a homomorphism of GjH onto 
G\ satisfying / = hog. To prove the uniqueness part, let us assume / = h'og for 
some homomorphism h! from G/H onto G\. Then h(aH) — f(a ) = (h/og)(a) = 
h'(g(a )) = h'{aH) for all aH £ G/H and so h = h!. Hence, h is the only 
homomorphism of G/H onto G\ such that f = ho g. 

Suppose h is one-one. Let a £ Ker /. Then /(a) = e\ and so h{aH) = ei. 
Since h{eH) = e\ and h is one-one, aH = eH. Thus, a £ H and so Ker / C H. 
By hypothesis, H C Ker f and so H = Ker /. Conversely, assume H = Ker 
/. Suppose h{aH) = h{bH). Then /(a) = f(b) or /(5 _1 a) = e\. Thus, b~ l a £ 
Ker f = H and so aH = bH, proving that h is one-one. ■ 

From Theorem 5.2.1, it follows that if H — Ker /, then h is an isomorphism 
and hence Gf Ker / is isomorphic to G i, i.e., every homomorphism of a group 
G onto a group G\ induces an isomorphism of Gj Ker / onto G\. This result 
plays a fundamental role in group theory. It is known as the fundamental 
theorem of homomorphisms for groups. This result is also called the first 
isomorphism theorem for groups. Considering the importance of this theorem, 
we state it in its general form and also give a direct proof of it. 

Theorem 5.2.2 (First Isomorphism Theorem) Let f be a homomorphism, 
of a group G into a group G\. Then f{G) is a subgroup of G\ and 

G/Ker f ~ f(G). 

Proof. By Theorem 5.1.2, f(G) is a subgroup of G\. Let H = Ker /. Define 
h:G/H - f(G) by 

h(aH) = f(a) 

for all aH £ G/H. Now aH = bH if and only if b~ 1 a £ H — Ker / if and only 
if f(b~ 1 a) = e\ if and only if /(6 _1 )/(a) = e\ if and only if f{a) = /(&). Thus, 
h is a one-one function. Let x £ f(G). Then x — f(b) for some b £ G. There¬ 
fore, h{bH) = f(b) = x. This shows that h is onto f(G). Finally, hfaHbH) = 
h{abH) = f(ab) = f(a)f(b) = h(aH)h(bH) for all aH,bH £ G/H, proving 
that h is a homomorphism. Consequently, G/Ker / ~ /(G). ■ 

In the following example we illustrate the first isomorphism theorem. 


Example 5.2.3 Let f be the hom,om.orphism, of (Z,+) onto (Z 3 ,+ 3 ) defined 



o 
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Z/ ( 6 ) . Now ( 6 ) is a normal subgroup ofZ and ( 6 ) C (3) = Ker f. Thus, there 
exists a hom.omorphi.sm, h of Z/ (6) onto Z 3 such that f — h o g. The homo¬ 
morphism h is defined by h(n + ( 6 )) = [n]. 


Z----Z 3 



3 + ( 6 ) 4 + ( 6 ) 5 + ( 6 ) 

Recall that a group G 1 is called a homomorphic image of a group G if 
there exists a homomorphism of G onto G\. 

From Theorem 5.2.1 and Corollary 5.2.2, we find that for each normal 
subgroup N of a group G, G/N is a homomorphic image of G, and for each 
homomorphic image G 1 , there exists a normal subgroup N of G such that 
G/N~Gi. 

Example 5.2.4 The group S 3 has (up to isomarphism.) only three horn,om,Or¬ 
phic images. This follows from, the fact that S 3 has only three normal subgroups. 
The horn,om,orphic images are S 3 , Z\, and Z 2 since {e}, S 3 , and {e, (1 2 3), (1 
3 2 )} are the only normal subgroups of S 3 and S 3 ~ 63 /{e}, Zi ~ S 3 /S 3 , and. 
Z 2 — S 3 /{e, (1 2 3), (1 3 2 )}. 

Theorem 5.2.5 Let G\ be a horn,om,orphic image of a, group G. Then the 
following assertions hold. 

(i) If G is cyclic, then G\ is cyclic. 

(ii) If G is commutative, then G 1 is commutative. 

(Hi) If G\ contains a,n element of order n and |G| is finite, then G contains 
an elem.ent of order n. 
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Proof. (i) Follows by Exercise 12 (page 152). 

(ii) Follows by Theorem 5.1.2(v). 

(iii) Let / : G — > G\ be an epimorphism and let a' be an element of G\ 
of order n. If n — 1, then e is the required element of G of order 1. Suppose 
n > 1. Since / is onto G\, there exists a e G such that /(a) — a'. Now o(a) 
is finite and by Theorem 5.1.2(v), o (a!) divides o(a), i.e., n divides o(a). Let 
t e Z + be such that o(a) = nt. Then t < o(a). Hence, a 1 ^ e. Now a nt — e. Let 
b = ah Then b n = e and by Theorem 2.1.28, 


gcd(t,o(a)) 


nt 

— = n.m 
t 


Note that the result in Theorem 5.2.5(iii) does not hold if |G| is not finite. 
For example, Zq is a homomorphic image of Z; Zq contains an element of order 
3, but Z has no element of order 3. 


Theorem 5.2.6 (Second Isomorphism Theorem) Let H and K be sub¬ 
groups of a group G with K normal in G. Then 


H/(HnK) ~ (. HK)/K. 


Proof. Define / : H -> (HK)/K by f{h) - hK for all h E H. Now 
f(hih 2 ) = h\h 2 K — h\Kh, 2 K ~ f{h\)f(h 2 ) for all hi, /12 6 H, proving that / 
is a homomorphism. Let xK 6 ( HK)/K. Then x — hk for some h E H and 
k € K. Thus, xK — ( hk)K = ( HK){kK ) == hK — f{h). This proves that / 
is onto ( HK)/K and so f(H) = ( HK)/K. Hence, by the first isomorphism 
theorem, it follows that 

H/Ker f ~ (. HK)/K. 

To complete the proof, we show that Ker f = H D K. Now 

Ker / = {he H \ f(h) = identity element of HK/K} 

= {he H\hK = K} 

= {heH I he K} 

- H n K. 

Consequently, H/H D K ~ ( HK)/K . ■ 

We illustrate the second isomorphism theorem with the help of the following 
example. 

Example 5.2.7 Consider the group (Z, +) and its subgroups H = (2) and 
K — ( 3). Then H + K — (2) + (3) — Z and H n K = (6) . Theorem, 5.2.6 says 
that 

H/(HnK) ~]H + K)/K, 
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i. e., 


(2) / (6) - Z/ (3). 


This isomorphism is evident if we notice that (2) / (6) = {0 + (6) , 2 + (6), 
4 + (6)} while Z/ ( 3 ) = {0 + ( 3 ) , 1 4 - ( 3 ) , 2 + ( 3 )}. The mopping 


h : (2) / (6) -* Z/ (3) 

defined by h : 0 + (6) —> 0 + (3) , 2 + (6) —► 2 + (6) , 4 + (6) —> 1 + (3) is the 
desired isomorphism.. 


Theorem 5.2.8 Let f be a homomorphism, of a group G onto a, group G \, H 
be a norm.al subgroup of G such that H D Ker /, and g, g' be the natural ho¬ 
momorphism,s of G onto G/H and G\ onto G\/f(H), respectively. Then there 
exists a unique isomorphism, h of G/H onto G\/f{H) such that g r o f — ho g. 


G- 


f 


9 

G/H 


h 


G 1 
9 ' 

Gi/fW) 


Proof. If we show Ker g' o f = H, then there exists a unique isomorphism 
h of G/H onto G\/f{H) by Theorem 5.2.1. Let a E H. Then (g' o f)(a) = 
g'(f(a}) — the identity of G\ff{H) since f{a) E f(H) = Ker g'. Thus, a E Ker 
g' of and hence H C Ker g' o f. Let a E Ker g' o f. Then g'(f(a)) = the identity 
of G\/f(H) and so f(a) E Ker g' = f(H). Therefore, there exists b E H such 
that f(b) = /(a) or f(ab~ l ) = e\. This implies that ab~ 1 E Ker / C H and so 
a = ( ab~ l )b E H. Thus, Ker g' o f C H. Hence, Ker g' o f = H. ■ 

Corollary 5.2.9 (Third Isomorphism Theorem) Let Hi, H 2 be normal 
subgroups of a group G such, that H\ C if 2 . Then 

(G/H 1 )/(H 2 /Hi)-G/H 2 . 


Proof. Make the following substitutions in Theorem 5.2.8: G/H\ for G 1, H 2 
for if, and {G/ H\)j{H 2 /H\) for G\jf(H), where in this case / is the natural 
homomorphism of G onto G/H\. Note that f(H 2 ) = H 2 /H\. 

G - - - -G/H 1 

G/H 2 - -~(G/ffi)/(ff 2 /j?i) ■ 
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We illustrate the third isomorphism theorem with the help of the following 
example. 

Example 5.2.10 Consider the group (Z,+) and the subgroups (6) and (3) of 
Z. Then 

z/ (3) = {0 + (3), 1 + (3), 2 + (3)}. 

Z/(6) = {0 + <6),l + <6),2+(6),3 + (6),4 + (6},5 + (6}}. 

(3) / (6) = {0 + (6), 3 + (6)}. 


Now, 

where 


(Z/{6»/((3>/(6)) = {0,1,2}, 

0 = 0 + (6> + «3) / <6» 

1 = 1 + (6) + «3) / (6)) 

2 = 2 + <6) + ((3)/(6». 


It is now clear that 


Z/ (3) ~ (Z/ <6»/«3) / (6)) 

since both are cyclic groups of order 3 and of course, by Corollary 5.2.9. 


We can at times determine the subgroups of a group G i from a group G 
whose subgroups are known if there is a homomorphism / of G onto G\. For 
if such an / exists, the following result says that the subgroups of G\ can be 
determined from the subgroups of G which contain Ker /. 

Theorem 5.2.11 (Correspondence Theorem) Let f be a hom,om.orph.ism, 
of a group G onto a. group G\. Then'f induces a one-one inclusion preserving 
correspondence between the subgroups of G containing Ker f and the subgroups 
of G i. In fact, if H and K are corresponding subgroups of G and G\, respec¬ 
tively, then H is a norm,al subgroup of G if and only if K is a normal subgroup 
of G^ 


Proof. Let 

Tt = {H | H is a subgroup of G such that Ker / C H} 


and 


K = {K | if is a subgroup of Gi}. 


Define /* : H K by for all H e Tt, f*(H) = {f(h ) | h G H}. Then 
f*(H) € K, by Theorem 5.1.2. Hence, f* is a function since f is a function. 
Let K G /C. Denote the preimage, f~ l (K), of K in G by H. Let a £ Ker /. 
Then /(a) — e\ € K and so a G f -1 (K) = H. Thus, Ker / C H. Let a, 
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b £ H. Then /(a), /(&) £ K and so f(ab 1 ) = f{a)f(b : ) = /(a)/(6) 1 G K. 
Therefore, a& -1 G H and so H is a subgroup of G containing Ker /, i.e., H G Pi. 
Hence, /* maps 7 i onto 1C. Let Hi, H 2 G Pi. Suppose f*(H\) = f*{H 2 )- Let 
h\ G Hi. Then there exists h 2 G H 2 such that /(hi) = f(h 2 ). This implies 
that f(hihf l ) = ei and so hihf 1 G Ker / C H 2 . Hence, hi = (hih^" 1 )h 2 G H 2 . 
Therefore, Hi C H 2 - Similarly, H 2 C Hi. Thus, Hi = H 2 and so /* is one-one. 
Clearly Hi C H 2 if and only if f*(Hi) C f*(H 2 ). In fact, since /* is one-one, 
Hi C H 2 if and only if f(ifi) C 

Suppose H is a normal subgroup of G such that Ker / C H. Let K — f*(H). 
We show that K is a normal subgroup of G. Let /(a) G G 1 and /(h) G K. 
Now aha,- 1 G H since H is a normal subgroup of G and so /(a)/(h)/(a) -1 = 
/(aha -1 ) G K. Hence, if is a normal subgroup of G\. Let J be a normal 
subgroup of Gi and L £ Pi be such that /*(L) = J. Let a £ G and h £ L. 
Then /(aha -1 ) = /(a)/(h)/(a) -1 G J and so aha -1 G L. This proves that L 
is a normal subgroup of G. ■ 

Corollary 5.2.12 Let N be a norm,al subgroup of a group G. Then every sub¬ 
group of G/N is of the form K/N , where K is a subgroup of G that contains N. 
Also, K/N is a norm,a,l subgroup of G/N if and only if K is a norm,al subgroup 
of G. 

Proof. Let g : G —» G/N be the natural homomorphism. If a £ G, then 
g{a) = aN. From Theorem 5.2.11, we find that this homomorphism induces 
a one-one mapping g* between the subgroups of G which contain Ker g = N 
and the subgroups of G/N. Let H be a subgroup of G/N. Then there exists a 
subgroup K of G such that N C K and H = g*(K ) = {^( a ) | a £ K] = K/N. 
The last part follows from Theorem 5.2.11. ■ 

The following example illustrates the correspondence theorem. 

Example 5.2.13 Let f be a hom.om.orphism, of (Z, +) onto (Zi2,+i2) defined 
by f(n) — [n] for all n £ Z. Then for Pi and K of Theorem, 5.2.11, 

W = {<12>,(6),<4),{3),(2>,Z} 

and, 


K = {([0]) 

,<[6]>,<[4]},([3]},<[2]> 

,z 12 }. 

/* : (12) - 

* ([0]} , 

/* : (3) - 

<[3]>, 

/* : (2) -> 

([2]), 

/* : (6) - 

<[6]), 

/* : (4) - 

([4]), 

/*: Z- 

Zi2- 


The following diagram, indicates the one-one inclusion preserving the cor¬ 
respondence property of f*. 


“Public. 
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7 Zio 



( 12 ) ([ 0 ]) 


Now ([9]) = {rx[9] | n e Z} C {n[3j | n £ Z} = ([3]). AJso, [3] — [27] = 
3[9] £ ([9]) . Therefore, {[3]) C {[9]) . Hence, {[3]) = {[9]) . Thus, the subgroup 
(9) of Z gets m,apped, to the subgroup ([3]) of Z 12 by f. However, this does not 
contradict Theorem. 5.2.11 since (9) 2 (12) • 

In the remainder of this section, we consider all isomorphisms of a group G 
onto itself. Recall that Aut(G) is the set of all automorphisms of G. 

Theorem 5.2.14 Let G be a group. Then (AutfG), o) is a group, where o 
denotes the composition of functions. 


Proof. Since iQ £ Aut(G), Aut(G) 7 ^ (j>. Let f,g£ Aut(G). Then fog is an 
automorphism by Exercise 14 (page 152) and Theorem 1.5.11. Hence, fog £ 
Aut(G). Clearly i G is the identity of Aut(G) and / _1 is the inverse of /. Also, 
o is associative by Theorem 1.5.13. Consequently, (Aut(G),o) is a group. ■ 

Theorem 5.2.15 Let G be a group and a £ G. Define 9 a : G —> G by 9 a (b ) = 
aba~ l for all b £ G. Then 

(i) 9 a € AutfG), 

(ii) 9 a o 9 h = 6 a b for all a,b £ G, 

(in) ( 0 a )~ l = 0 a - 1 , 

(iv) for all a £ Aut(G), a o 9 a o a~ l = Q a (a)- 


Proof. (i) Let c,d £ G. Suppose c = d. Then aca -1 = ada~ l or 9 a (c) = 
9 a {d). Therefore, 6 a is well defined. Now 9 a (cd) = a(cd)a~ l = (aca _1 )(oda _1 ) = 
9 a {c)9 a (d). This shows that 9 a is a homomorphism. Also, c = 9 a (a~ 1 ca ), prov¬ 
ing that 9 a is onto G. Suppose 9 a {c) — 9 a {d). Then oca -1 = ada~ 1 and so 
c~ d. Thus, 9 a is one-one. Consequently, 9 a £ Aut(C). 

(ii) Let a, 6 £ G. Then ( 9 a °9b)(c ) — 9 a (9b(c)) = 9 a (bcb~ 1 ) = a(bcb~ 1 )a ~ 1 = 
( ab)c(ab ) _1 = 9 a b(c ) for all c £ G. Hence, 9 a ° 9b = 9 a b . 

(iii) Note that 9 a o 9 a -1 = 9 aa -i = 9 e = i G and 9 a ~ 1 o 9 a = 0 a -i a = 9 e = i G . 
Thus, {9 a )~ l = 0 a - 1 . 

(iv) Let a £ Aut(G). Now (ao0 o oa! -:L )(&) = a(0 a (a _ 1 (6))) = a(aa _ 1 ( 6 )a~ 1 ) 


— a(a)o;(a 1 ( 6 ))o;(a x ) = a(a)b(a(a)) 1 = 0 a ( a )(6) for all b £ G. Hence, 
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ao 9 a o a 1 = O a ( a ). ■ 

The automorphism 9 a of Theorem 5.2.15 is called an inner automorphism 
of G. We denote by Inn((7) the set of all inner automorphisms of G. 

Theorem 5.2.16 Let G be a group. Then Inn(G ) is a, normal subgroup of 
Aut{G). 

Proof. Since {q — 9 e G Inn(G), Inn(G) / <fi. By Theorem 5.2.15(i), 
Inn (G) C Aut(G). Let 9 a ,9 b G Inn(G'). Then 9 a o 9f l — 9 a o 9 b -1 = 0 ab -i G 
Inn(G). Hence, Inn(G) is a subgroup of Aut(G) by Theorem 4.1.3. Let a G 
Aut(G). Then by Theorem 5.2.15(iv), a o 9 a o a -1 = # a ( a ) G Inn(G). Hence, 
Inn(G) is a normal subgroup of Aut(G). ■ 


Theorem 5.2.17 Let G be a group and H be a subgroup of G. Then 

~ a subgroup of Aut(H) : 

where N{H) — {x G G | xHx~ l — H} is the norm.alizer of H and. C(H ) = 
{x G G | xhx -1 — h for all h € H] is the centralizer of H. 



Proof. Define / : N{H) —>Aut(if) by for all a G N(H), 

f(a) = 9 a \ H . 

Then / is well defined. Let ai,a ,2 G N(H). Then /(a 1 CZ 2 ) — 9 aia2 \n — 0 ai \ h 0 
9 a2 \h — /( a i) 0 f( a 2 )- Thus, / is a homomorphism. Now 


{a 

GC 

f{°) 

= *h} 

{a 

G(? 

0a = 

Ih} 

{a 

eG 

0 a {b) 

= inib) for all b G H} 

{a 

gG 

aba~ 

1 - b for all b G H} 

{a 

G G 

ab = 

ba for all b G H} 

C(H). 




Thus, by the first isomorphism theorem, we have the desired result. ■ 
Corollary 5.2.18 Let G be a group. Then 

~ Inn(G). 



Proof. Let H = G in Theorem 5.2.17. Then we have N{G ) = G and 
C(G) = Z(G). ■ 
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5.2.1 Worked-Out Exercises 

0 Exercise 1 Find all homomorphic images of the additive group Z. 

Solution: Let H be a homomorphic image of (Z,+). There exists a ho¬ 
momorphism / of Z onto H. By the first isomorphism theorem, Z/Ker / ~ H. 
Since Ker f is a subgroup of Z, Ker / = r?,Z for some integer n > 0. Hence, 
H ~ Z/nZ for some integer n > 0, On the other hand, for any n > 0, nZ is a 
subgroup of Z and since Z is commutative, nZ is a normal subgroup of Z. There 
exists a natural homomorphism / from Z onto Z/nZ given by /(m) = m + nZ 
for all m € Z. This shows that Z/nZ is a homomorphic image of Z for all 
n > 0. Consequently, the homomorphic images of Z are the groups (up to iso¬ 
morphism) Z/nZ, n > 0. Now for n — 0, Z/nZ ~ Z and for n > 0, Z/nZ ~ Z n 
(Exercise 2, page 164). Therefore, we conclude that the homomorphic images 
of Z are the cyclic groups Z and Z n , n > 0. 

0 Exercise 2 If there exists an epimorphism of a finite group G onto the 
group Z§, show that G has normal subgroups of index 4 and 2 . 

Solution: Let / : G —» Zs be an epimorphism. Then by the first isomor¬ 
phism theorem, G/Ker / ~ Zs- Hence, G/Ker / is a cyclic group of order 8 . 
Thus, G/Ker / has a normal subgroup Hi of order 4 and a normal subgroup 
H 2 of order 2. By the correspondence theorem, there exist normal subgroups 
Afi and N 2 of G such that Ker f C N u Ker f ' ( C jV 2 , Afi/Ker f — Hi, and 
A r 2 /Ker / = H 2 . Thus, 

8 - |G/Ker /| = [G : Ker /] = [G : AfilfAfi : Ker /] = [G : Afi]4. 

This implies that [G : Afj] = 2. Similarly, [G : A^ 2 ] = 4. 

<) Exercise 3 Show that 4Z/12Z ~ Z 3 . 

Solution: Define / : 4Z —■> Z 3 by /(4n) = [n] for all 4n £ 4Z. One can show 
that / is an epimorphism. Then from the first isomorphism theorem, 4Z/Ker 
/ ~ Z 3 . Now Ker / = {An e 4Z | /(4n) ~ [0]} — {4n G 4Z | [n] = [0]} = 12Z. 

Exercise 4 Let G be a finite group and / be an automorphism of G such that 
for all a G G, /(a) = a if and only if a = e. Show that for all g <E G, there 
exists a £ G such that g — a~ l f{a). 

Solution: Let G = {aj, a 2 ,..., a n }. Let S' = {a^ 1 /(ai),..., a~ 1 /(a n )}. 
Then S C G. Next, we show that all elements of S' are distinct. Now a~ l f(di) — 
a 7 1 /( a i) and onl y f { a i)f { a j)~ l ~ a i a ] 1 if and only if f{a z a~ l ) = a^aj 1 
if and only if = e if and only if a t = a^. This shows that all elements of 

S are distinct and so |S'| = n. Thus, S = G. Let g G G. Then g £ S. Hence, 
g = a~ 1 /( a ) f° r some a G G. 
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Exercise 5 Let G be a finite group and / be an automorphism of G such that 
for all a G G, /(a) — a if and only if a = e. Suppose that / 2 — i G} where 
i G denotes the identity map. Prove that G is commutative. 

Solution: Let g G G. By Worked-Out Exercise 4, g — a -1 /(a) for some 
a e G. Then g = i G (g ) = / 2 (a - 1 /(a)) = /(/(a -1 /0))) = f{f(a~ l )f 2 (a)) = 
/(/(a) - 1 a) = /(g -1 ). This implies that /(g) — g -1 for all g 6 G. Let a,b G G. 
Then (a 6) -1 — /(a&) — /(a)/( 6 ) — a - 1 6 -1 — ( 6 a ) -1 and so ab — ba. Hence, G 
is commutative. 


0 Exercise 6 Let H be a subgroup of index 2 in a finite group G. If the order 
of H is odd and every element of G\H is of order 2, prove that H is 
commutative. 


Solution: Since [G : H] = 2, H is a normal subgroup of G. Now G — 
H U Hg, where g £ H. Then o(g) — 2 . Define / : G —» G by for all a G G, 
/(a) = gag -1 . Then / is an automorphism of G. Now / 2 (a) = /(/(a)) = 
/(gag -1 ) = g(gag - 1 )g -1 = g 2 ag -2 = a since g 2 = e. Hence, / 2 = i G . Since H 
is a normal subgroup of G, /(fi) = aha~ l G H for all h G H. Thus, / is also 
an automorphism of H. Let h, G H. Suppose f{h ) — h. Then gfig -1 = h or 
gh = hg. Since gh ^ H, o(gfi) = 2. Therefore, h 2 = g 2 h 2 = (gfi ) 2 = e. Since 
the order of H is odd, h 2 = e implies that h = e. Hence, f{h) — h if and only 
h — e. Thus, / is an autom -rphism of H such that / 2 = i G and /(/?,) — h if 
and only if h = e. By Worked-Out Exercise 5, H is commutative. 


0 Exercise 7 Show that Aut(Z n ) ~ U n . 


Solution: Define a :Aut(Z n ) —> U n by a(/) — /([ 1 ]) for all / G Aut(Z n ). 
Now m/([l]) — /([m]). Hence, /(.[m]) — [0] if and only if m is divisible by n. 
Thus, °(/([l])) = n. This implies that /([1]) G U n and so a is well defined. Let 
/,g G Aut(Z n ). Then a(/og) = (/og)([l]) = /(g([l])). Suppose g([l]) = [fe]. 
Then a(f o g) = /([fc]) = kf{[ 1]) - A[l]/([1]) - [k] f([ 1]) = /([l])g([l]) - 
a(f)a(g). Hence, a is a homomorphism. Now 

Ker a = {/ e Aut(Z„) | q(/) = [ 1 ]} 

= {/ S Aut(Z n ) | /([ 1]) = [1]} 

= {/ € Aut(Z n ) | / is the identity map}. 


Hence, a is a monomorphism. Finally, we show that a is onto U n . Let [t] G U n . 
Then t and n are relatively prime. Define / : Z n —> Z n by /([m]) = [ml] for all 
[to] G Z n . Let [r], [s] € Z n . Suppose [r] — [s]. Then r — s = nq for some q G Z. 
Thus, rt — st = nqt. Hence, [rt] = [si], proving that / is well defined. Clearly / 
is a homomorphism. Suppose f([r]) = /([s]). Then [rt] — [si] and so n divides 
rt — st = (r — s)i. Since i and n are relatively prime, n divides r — s. Therefore, 
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[r] = [s]. This implies that / is one-one. Now let [r] E Z n . Since gcd(n,t) = 1, 
there exist p,g £ Z such that 1 = tp + nq. Hence, r = ptr + qnr. This implies 
[r] = [ptr). Now \pr] E Z n . Thus, f(\pt] = [ptr ] = [r]. We therefore find that / 
is onto. Hence, f E Aut(Z n ). Now a(f) = /([l]) = [t] shows that a is onto U n . 
Thus, a is an isomorphism. Consequently, Aut(Z n ) ~ U n . 


5.2.2 Exercises 

1 . Let R* be the multiplicative group of all nonzero real numbers and T — 
{1,-1}. Then T is a subgroup of R*. Prove that the quotient group R*/T 
is isomorphic to the multiplicative group R + of positive real numbers. 

2. For any positive integer n, prove that ZjnZ ~ Z n . 

3. Show that 8Z/56Z ~ Z 7 . 

4. Let G be a group and A and B be normal subgroups of G such that 
A~ B. Show by an example that G/A 9 ^ G/B. 

5. For any two positive integers m, n such that gcd (m,n) = 1, prove that 
mZjmnL ~ Z n . 

6 . Let G be the group of symmetries of the square and FQ the Klein 4-group. 

Show that the mapping /:(?—> K 4 defines a homomorphism of G onto 
FC 4 , where /(r 180 ) = /(r 36 o) = e, /(r 90 ) = /(r 27 0 ) = a, f(h) = f(v) = 6 , 
f{di) = f(d 2 ) = c. 

7. In Exercise 6 , exhibit the one-one inclusion preserving correspondence 
between the subgroups of G containing Z(G ) and the subgroups of FQ. 

8 . Let G and K 4 be as in Exercise 6 . Let g be the natural homomorphism of 
G onto G/Z{G ), where Z(G) is the center of G. Prove that Z{G) ~ Ker 
/ and exhibit the isomorphism h of G/Z(G ) onto K 4 such that f = hog. 

9. Show that Z& is not a homomorphic image of Z 15 . 


10. Show that Z 9 is not a homomorphic image of Z 3 x Z 3 . 

11 . Show that if there exists an epimorphism from a finite group G onto the 
group Z 15 , then G has normal subgroups of indices 5 and 3, respectively. 


12. Partition the following collection of groups into subcollections of groups 
such that any two groups in the same subcollection are isomorphic. 


(i) ( z >+)> (ii) ( z 6,+), (iii) ( z 2 ,+), O) S 2 , (v) Ss, (vi) (17Z.+), (vii) 
(355,+), (vii) (Q, +), (ix) (R, +), (x)(RV), (xi) (R + ,-), (xii) (Q*,-), 

(xiii) (C*,-), ( x i y ) (( 7r ))‘)i where R* denotes the set of nonzero real 
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numbers, Q* denotes the set of nonzero rational numbers, C* denotes 
the set of nonzero real numbers, R + denotes the set of positive real 
numbers, and ((ir) , -) is the cyclic subgroup of (R + , •) generated by i r. 

13. Show that 

(i) Aut(Z 5 ) ~ Z 4 . 

(ii) Aut(Zs) — Klein 4-group. 

14. Find all automorphisms of the group 

15. Show that |Aut(Z p )| = p — 1, where p is a prime. 

16. Prove that Inn(5s) ~ S 3 ~ Aut(5a). 

17. Determine Aut(5 4 ). 

18. Let G be a cyclic group of order n and cj> be the Euler (^function. Prove 
that |Aut(G)| = </>(n). 

19. Let G be a group such that Z{G ) = {e}. Prove that Z(Aut(G)) — {e}. 

20. Let G be a group and if be a subgroup of G. H is called a characteristic 
subgroup of G if /(if) C H for all / € Aut(G). 

(i) Show that every characteristic subgroup of G is a normal subgroup of 

G. 

(ii) Give an example of a group G and a subgroup H such that if is a 
normal subgroup of G , but if is not a characteristic subgroup of G. 

(iii) Show that Z{G) is a characteristic subgroup of G. 

(iv) Let if and K be characteristic subgroups of G. Show that HK and 
if H if are characteristic subgroups of G. 

(v) Let if and K be subgroups of G such that if C if. Show that if K 
is a normal subgroup of G and if is a characteristic subgroup of (7, then 
H is a normal subgroup of G. 

(vi) Let if and if be subgroups of G such that if C if. Show that if if 
is a characteristic subgroup of if and if is a characteristic subgroup of 
G , then if is a characteristic subgroup of G. 

(vii) Suppose G is cyclic. Show that every subgroup of G is a character¬ 
istic subgroup of G. 

21. Show that the only characteristic subgroups of (Q, +) are {0} and Q. 
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22. Which of the following statements are true? Justify. 

(i) Any epimorphism of Z onto Z is an isomorphism. 

(ii) Any epimorphism of a group G onto G is an isomorphism. 

(iii) The quotient group 4Z/64Z has five subgroups. 

(iv) Z 5 has five homomorphic images. 

(v) 2Z/6Z is a subgroup of Z/ 6 Z. 

(vi) There exist four subgroups of Z which contain 10Z as a subgroup. 

(vii) Let G and H be two groups, A be a normal subgroup of G, and B 
be a normal subgroup of H. If G ~ H and A ~ B, then G/A ~ H/B. 


5.3 The Groups D 4 and Q$ 

In Section 5.1, we saw that there are two types of groups of order 4 and two 
types of groups of order 6 . In this section, we wish to classify all noncommuta- 
tive groups of order 8 . We will consider finite commutative groups in Chapter 
9. First we introduce two groups D 4 and Q 8 and study these groups in de¬ 
tail. The study of these groups will eventually lead us to the classification of 
noncommutative groups of order 8 . 


Definition 5.3.1 A group G is colled a, dihedral group of degree 4 if G is 
generated, by two elem.ents a and b satisfying the relations 

o(a) = 4, o ( 6 ) = 2, and ba = at’b. 


Example 5.3.2 Let T be the group of all 2 x 2 invertible matrices over R 
under usual m.atrix multiplication. Let G be the subgroup ofT generated by the 
matrices 


A = 


0 1 

-1 0 


and B = 


0 

1 


1 

0 


Then o(A) = 4 and o(5) = 2. Now 


BA = 


[or 


0 1 ' 


1- 

O 

1—H 

1 

1 _ 

—1 

0 

i-H 


1 

0 

1 _ 


-1 

i—l 

O 


and 


A 3 B = 


" 0 -1 ’ 


J 

T—i 

O 


1 

)—* 

0 

-J 

1 0 


1 0 


0 1 


Thus, BA = A 3 B. Hence, G is a dihedral group of degree 4. 
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Example 5.3.3 Consider S A . Let G be the subgroup of S A such that G is gen¬ 
erated by the permutations 

a = (1 2 3 4) and b = {2 4). 

Then a 2 = (1 3) o (2 4), a 3 ~ (1 4 3 2), a 4 = e, b 2 = e, and b o a — (1 4) o (2 
3 ) = a 3 o b. Hence, o(a) = 4, o( 6 ) = 2 , and b o a = a 3 o b. Thus, G is a dihedral 
group of degree 4. 

The following theorem reveals some interesting properties of D A . These 
properties are similar to the properties listed in Example 4.1.18 for D 8 . 

Theorem 5.3.4 Let G be a dihedral group of degree 4 generated by the ele¬ 
ments a and b such that 

o (a) — 4, o ( 6 ) -- 2, and ba = a 3 b. 

Then the following assertions hold. 

(i) Every element of G is of the form a' l b J , 0 < i < 4, 0 < j < 2. 

(ii) G has exactly eight elements, i.e., |G| = 8 . 

(in) G is a, noncomm.utative group. 

Proof. (i) Since G = (a, b) , 

G = {a il b jl a i 2 b j2 ■ ■ ■ a in b jn \ i u j t 6 Z, 1 < t < n, n G N}. 

Since ba — a 3 b , it follows that every element of G is of the form a n b m , where 
n,m E Z. Now a 4 = e, b 2 — e, a -1 = a 3 , and 6 _1 = b. This implies that every 
element of G is of the form a 1 IP , 0 < i < 4, 0 < j < 2 . 

(ii) By (i), every element of G is of the form ahbP , 0 < i < 4, 0 < j < 2 . 
Thus, |G| < 8 . Since o(a) = 4, it follows that e,a,a 2 ,a 3 are distinct elements 
of G. Then b,ab, a 2 b,a 3 b are also distinct elements of G. Also, since aT 1 = a 3 , 
6 -1 = b , and a ^ b ^ e, 

{e, a, a 2 , a 3 } D { 6 , ab , a 2 6 , a 3 6 } — (f. 

Thus, G = {e, a, a 2 , a 3 , 6 , ab, afb, a 3 b}. Hence, G has eight elements. 

(iii) Suppose ab = ba. Then ab = a 3 b. This implies that a 2 = e, which is a 
contradiction. Hence, ab ^ ba, proving that G is noncommutative. ■ 

It is easy to see that any two dihedral groups of degree 4 are isomorphic. 
Hence, there exists only one dihedral group (up to isomorphism) of degree 4. 
We denote a dihedral group of degree 4 by D A . 

We now describe all subgroups of D A and draw the lattice diagram of sub¬ 
groups of D a . 

“Public. 7feaiAe»tafLca/ 
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In D 4 , 

o(a) = 4, o(a 2 ) = 2, o(a 3 ) = 4, 0 ( 6 ) = 2, 

( ab) 2 — abab = aa 3 bb = e, 

{a 2 b) 2 = a 2 ba 2 b = a 2 (a 3 b)ab — abab - e, 

(a 3 6) 2 ~ a 3 ba 3 b = a 3 (a 3 6 )a 2 6 — a 2 ba 2 b - e. 

From this, it follows that H\ — { e,a 2 }, H 2 = {e, 6 }, H 3 = {e,a 6 }, — 

{e,a 2 &}, and ^5 = {e,a 3 6 } are subgroups of order 2. By Lagrange’s theorem, 
D\ has no subgroups of order 3, 5, 6 , or 7. Now 

T\ = {e,a,o 2 ,a 3 } 

T 2 = {e, a 2 , 6 , a 2 6 } 

T 3 = {e, ab , a 2 , a 3 6 } 

are subgroups of order 4. We ask the reader to verify that {e}, Hi> H 2 , # 3 , 
77 4 , 7 / 5 , Ti, T 2 , T 3 , and Z ) 4 are the only subgroups of Z) 4 . Hence, the lattice 
diagram of the subgroup lattice of _D 4 is the following: 



It is interesting to note in D 4 that H$ is a normal subgroup of T 3 and T 3 
is a normal subgroup of Z) 4 , but is not a normal subgroup of Z) 4 . We also 
note that every nontrivial subgroup of D 4 is of order 2 or 4. Therefore, every 
nontrivial subgroup of Z ) 4 is commutative. However, since T 2 is a nontrivial 
subgroup of D 4 and T 2 is not cyclic, it follows that not every nontrivial subgroup 
of D 4 is cyclic. Finally, we also note that D 4 is isomorphic to Sym, the group 
of symmetries of a square (page 69). This follows from Theorem 5.3.4 and the 
group table of the group of symmetries of the square given on page 70. 

Next, we consider Q 8 . 


Definition 5.3.5 A group G is called a quaternion group if G is generated, 
by two elements a , b satisfying the relation 

o (a) — 4, a 2 = 6 2 , and ba = a 3 b. 
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Example 5.3.6 Let T be the group of all 2 x 2 invertible matrices over C 
under usual m.atrix multiplication. Let G be the subgroup ofT generated, by the 
matrices 


A = 


0 1 

-1 0 


and B 


0 i 
i 0 


Then o(A) = 4 and 


A 2 


-1 0 

0 -1 



Now 


and 


BA — 


A 3 B = 


0 i 
i 0 


0 -1 

1 0 


o 

H- 1 

_i 


o 

■ 

1 

-1 0 


-1 

• <S> 

O 

_i 

i 

* CS5 

o 

1_ 


’ -i 0 ' 

i 0 


0 i 


Thus, BA = A 3 B. Hence, G is a quaternion group. 


We leave the proof of the following theorem, which is similar to the proof 
of Theorem 5.3.4, as an exercise. 


Theorem 5.3.7 Let G be a quaternion group generated by the elements a and 
b such that 

o(a) =4, a 2 = 6 2 , a,nd ba — a 3 b. 

Then the following assertions hold. 

(i) Every element of G is of the form, ahb 3 , 0 < i < 4, 0 < j < 2. 

(ii) G has exactly eight elements, i.e., |G| — 8. 

(Hi) G is a noncomm,utative group. ■ 

It is easy to see that any two quaternion groups are isomorphic. Hence, 
there exists only one quaternion group (up to isomorphism) and we denote it 
by Qs- 

Next, we determine all subgroups of Qs. 

Let Qs = (a, b ) , where o(a) = 4, a 2 — 6 2 , and ba = a 3 b. Then 

Qs — {e, a, a 2 , a 3 , 6, ab : a 2 b, a 3 b}. 


In Qs, 


o(a) = 4, o(a 2 ) — 2, o(a 3 ) — 4, o( 6 ) = 4. 


(a 6) 2 — abab = aa 3 bb = b 2 — a 2 . 


IPiLfic .: 


Now 
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Thus, o(a&) = 4. Also, 


(a 2 &) 2 = a 2 ba 2 b = a 2 (a 2 b)ab = a 5 bab = abab 


and 

(a 3 6) 2 = a 2 ba 2 b = a 2 (a 2 b)a 2 b — a 2 ba 2 b. 

Hence, o (a 2 b) = 4 and o (a 3 b) = 4. It now follows that Hq — {e}, H\ = {e,a 2 }, 
H 2 = {e, a, a 2 , a 3 }, Hg = {e, ab , a 2 , a 3 6}, and H 4 — {e, 6, a 2 , a 2 b} are subgroups 
of Qg. We ask the reader to verify that Hq, H \, H 2 , Hg, H 4 , and Qg are the 
only subgroups of Qg. Thus, the lattice diagram of the subgroup lattice of Qg 
is the following: 


Hg 



H a 


Since [Qs : # 2 ] — [Qs : # 3 ] — [Qs : H A \ = 2, H 2 , Hg, and H A are normal 
subgroups of Qs- Now ba 2 b~ l = baab~ l = a 2 bab~ l = a 2 a 2 bb~ l = a 2 G H\. 
Since Qs = (a, 6) , H\ is a normal subgroup of Qs- Thus, every subgroup of 
Qg is a normal subgroup of G. It is also interesting to observe that all proper 
subgroups of Qs are cyclic. 


Theorem 5.3.8 D A ^ Qg. 


Proof. We note from the above discussion that Qs contains six elements of 
order 4 while D 4 contains only two elements of order 4. Hence, D 4 qk Qg. ■ 

The next theorem classifies all noncommutative groups of order 8. 

Theorem 5.3.9 There exist (up to isomorphism) only two noncommutative 
nonisom,orphic groups of order 8. 


Proof. Let G be a noncommutative group of order 8. Since |G| is even, 
there exists an element u £ G, u ^ e, such that u 2 = e. If x 2 ~ e for all x G G, 
then G is commutative, a contradiction. Thus, there exists a £ G such that 


a 


7 ^ e. Since o(a)| 8 , o(a) = 4 or 8 . If o(a) = 8 , then G is cyclic and hence 


commutative, a contradiction. Thus, o(a) = 4. Let H = {e,a, a 2 , a 3 }. Then H 
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is a subgroup of G of index 2 and so H is a normal subgroup of G. Let b £ G 
be such that b ^ H. Then G = H U Hb and H O Hb = <p. This implies that 

G = {e, a, a 2 , a 3 , 6, a6, a 2 b , a 3 6} = (a, b) . 

Now 6 a 6 _1 E H. If bab~ l = e, then a = e, a contradiction. Thus, bab~ l 7 ^ e. 
If bab ~ 1 = a, then ah = ba and hence G is commutative, a contradiction. If 
bab~ l — a 2 , then ba 2 b~ l — ( bab~ 1 ) 2 = a 4 = e and so a 2 — e, a contradiction. 
Therefore, 6 a 6 _1 = a 3 and so ba = a 3 b. Since \G/H\ = 2 and b £ H, o [Hb) = 2 . 
Hence, b 2 E H. If b 2 = a or a 3 , then o(6) = 8 and so G is commutative, a 
contradiction. Therefore, either b 2 — e or b 2 = a 2 . It now follows that if G is a 
noncommutative group of order 8, then either 

G — (a, b) such that o (a) =4, o (6) — 2 ,and 6a = a 3 6 


or 

G = (a, 6 ) such that o (a) =4, 6 2 = a 2 , and ba = a 3 b. 

In the first case, G ~ D 4 and in the second case, G ~ Qs - ■ 

5.3.1 Worked-Out Exercises 
0 Exercise 1 Find Z(D 4 ). 

Solution: It is known that Z(D 4 ) is a normal subgroup of D 4 . Now 
D 4 has five normal subgroups: D 4 , {e}, H\ = {e,a 2 }, Ti = {e,a,a 2 ,a 3 }, 
T 2 = {e, a 2 , 6 , a 2 6 }, T 3 — {e, a 6 , a 2 , a 3 6 }. Since ab 7 ^ 6 a, D 4 , Ti, and T 2 can¬ 
not be Z(D 4 ). If [ab)b — 6 (a 6 ), then a = ( 6 a )6 = a 3 6 2 = a 3 and so a 2 = e, 
a contradiction. Hence, T 3 7 ^ Z(D 4 ). Now a 2 6 = a 6 6 — a 3 (a 3 6 ) = a 3 ( 6 a) = 
( 6 a)a = 6 a 2 . Hence, a 2 E Z(D 4 ). Thus, Z(D 4 ) = {e,a 2 } = H\. 

<0 Exercise 2 Find Inn(D 4 ). 

Solution: By Corollary 5.2.18, Inn(D 4 ) ~ D 4 /Z{D 4 ). Now D 4 /Z(D 4 ) is a 
group of order 4 and 

D 4 /Z(D 4 ) = {eZ(D 4 ), aZ{D 4 ), bZ(D 4 ), abZ(D 4 )}. 

Since a 2 E Z(D 4 ), 6 2 — e, and (a 6) 2 = e, we find that each nonidentity element 
of D 4 /Z(D 4 ) is of order 2. Hence, D 4 /Z{D 4 ) — K 4 , the Klein 4-group. 

5.3.2 Exercises 

1. In D 4 , find subgroups H and K such that K is a normal subgroup of H 
and H is a normal subgroup of D 4 , but K is not a normal subgroup of 
D 4 . 
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2. Show that Qg is the union of three subgroups each of index 2. 

3. Find all homomorphic images of D 4 . 

4. Find all homomorphic images of Q%. 

5.4 Group Actions 

As previously mentioned, the theory of groups first dealt with permutation 
groups. Later the notion of an abstract group was introduced in order to ex¬ 
amine properties of permutation groups which did not refer to the set on which 
the permutations acted. However, one is primarily interested in permutation 
groups in geometry. Also, permutation groups are used in counting techniques 
that are important in finite group theory. An example of this can be seen in 
the proof of Lagrange’s theorem. We extend the notion of a permutation on a 
set to a group action on a set. We use the notion of a group action on a set to 
determine, via counting techniques, important properties of finite groups. 

Let G be a group and S a nonempty set. A (left) action of G on S is a 
function • : G x S —» S (usually denoted by -(g,x) —* g ■ x) such that 

(i) { 9192 ) ■ * = 9\ • (92 ‘ z), and 

(ii) e • x — x, where e is the identity of G 
for all x E S, < 71 , g 2 E G. 

Note: If no confusion arises, we write gx for g ■ x. 

If there is a left action of G on S, we say that G acts on S on the left and 
S is a G-set. 

Example 5.4.1 Let G be a permutation group on a set S. Define a left action 
of G on S by 

ax — <j(x) 

for all a E G, x E S. Let x € S. Now ex = e(x) = x , where . e is the identity 
permutation on S. Leta\,a 2 € G. Then (a\oa 2 )‘X = {aioa 2 ){x) = a\{a 2 {x)) = 
a 1 • ( 02 ( 2 ;)) — o\ ■ ((72 • x). Hence, S is a G-set. 

Example 5.4.2 Let G be a group and H be a norm.al subgroup of G. Define a 
left action of G on H by 

C9, h) -»> ghg~ l 

for all g £ G, h e H. We denote this by g ■ h — ghg~ l . Let h € H. Now 
e • h — ehe~ l = ehe = h. Let gi,g 2 G G. Then {gig 2 ) * h = (£ 132 )^( 2152 )“ 1 = 
{ 9 \ 92 )h( 92 l gf l ) = g\{92hg2 l )gf l = gi(g 2 ■ h)gf l = g x • (g 2 ■ h). Hence, H is a 
G-set. 
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Theorem 5.4.3 Let S be a G-set, where G is a group and S is a nonempty 
set. Define a relation ~ on S by for all a,b E S, 

a ~ b if and only if ga = b for some g E G. 

Then ~ is an equivalence relation on S. 

Proof. Since for all a E S, ea = a, a ~ a for all a E S. Thus, ~ is reflexive. 
Let a,b,c E S. Suppose a ~ b. Then ga = b for some g E G, which implies that 
g~ l b = g~ l (ga) = ( g~ 1 g)a = ea = a. Hence, b ~ a and so ~ is symmetric. 
Now suppose a ~ b and 6 ~ c. Then there exist 51 , gi E G such that g\a = b 
and gfio — c. Thus, ( g 2 g\)a — g 2 (gia) = 92 b = c and so a ~ c. Hence, ~ is 
transitive. Consequently, ~ is an equivalence relation. ■ 

Definition 5.4.4 Let S be a G-set, where G is a group and S is a nonempty 
set. The equivalence classes determined by the equivalence relaiion of Theorem, 
5.4.3 are called the orbits of G on S. 

For a £ S, the orbit containing a is denoted by [a]. 

Lemma 5.4.5 Let G be a group and S be a G-set. For all a E S, the subset 

G a — {9 £ G | ga — a} 


is a subgroup of G. 

Proof. Let a E S. Since ea = a, e E G a and so G a 7 ^ (p . Let g,h E G a - Then 
ga = a and ha — a. This implies that ( gh)a = g(ha) = ga = a and so gh E G a . 
Now h~ l a = h~ 1 (ha) = ( h~ l h)a = ea = a. Thus, h~ l E G a . Hence, G a is a 
subgroup of G. ■ 

The subgroup G a of Lemma 5.4.5 is called the stabilizer of a or the 

isotropy group of a. 

Lemma 5.4.6 Let G be a group and S be a G-set. For all a E S, 

[G : G a \ = | [a] | . 


Proof. Let a E S. Let C be the set of all left cosets of G a in G. Now 
[a] = {b E S | a ~ b} = {b E S \ ga = b for some g E G] = {ga \ g E G}. 
We now show that there exists a one-one function from C onto [a]. Define 

/ : £ —> [a] 

iPufic. TfeaiAewtafLea/ 
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by 


f(gG a ) = ga 


for all gG a G £■ Let pi, 52 £ G. Then g\G a — g 2 G a if and only if g 2 l g\ € G a 
if and only if g^igia) = (^ 1 pi) a = a if and only if g\a — g 2 d. Thus, / is a 
one-one function from C into [a]. Let b E [a]. Then there exists g £ G such that 
ga — b. Thus, f(gG a ) = ga = b. This implies that / is onto [a]. Consequently, 
[G:G a ] = \C\ = \[a]\.m 


Theorem 5.4.7 Let G be a group and S be a G-set. If S is finite, then 

|S| = £[G:G„], 

a€ A 


where A is a subset of S containing exactly one element from, each orbit [a . 


Proof. By Theorem 5.4.3, S can be partitioned as the union of orbits. 
Therefore, 

S — [fl] - 


Hence, 

|s| = £IMI = £[g : G a ) by Lemma 5.4.6. ■ 

cz£-A 


Theorem 5.4.8 Let G be a group and S be a G-set. Then the left action of 
G on 1 S induces a horn,omorphism from, G onto A(S), where A{S) is the group 
of all permutations of S. 


Proof. Let g E G. Define r g : S —■» S by T g (a) = ga for all a E S. Let a, 6 € S. 
Then T g (a) = r g (b) if and only if ga = gb if and only if a — b. Therefore, r g 
is a one-one function. Now 6 = g(g~ 1 b) = r g (g~ l b) and g~ l b E S. This shows 
that r g is onto S. Thus, r g 6 A(S). Let g\, g 2 £ G. Then T gig 2 {a) — (gig 2 )a = 
gi(g 2 a) = r gi (g 2 a) = T gi {r g 2 {a)) = (r gi °T 52 )(a) for all a 6 S. This implies that 
^9192 T gi 0 t 92 • Define 

if : G -> A(S) 


by 


^(g) = T g 

for all g € G. Then if is a function. Now if{g\g 2 ) — t 5i52 = r gi o r g2 = 
if(gi) o if{g 2 ) for all gi, g 2 E G. This proves that if is a homomorphism. ■ 


The following corollary, which is known as the extended Cayley’s theorem, 
follows from the above theorem. 


Corollary 5.4.9 Let G be a group and H be a subgroup of G. Let S — {aH \ 
a E G}. Then there exists a hom,om,orphism, if from G into A(S) (the group of 
all permutations on S) such that Ker if C H. 
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Proof. First we note that S is a G-set, where the left action of G on 
S is defined by g(aH ) — (pa) 77 for all g £ G. This left action induces the 
homomorphism ip of Theorem 5.4.8. Now 

Ker ip = \q £ G \ ip(a) = r a — the identity mapping on S] 

= {g € G | T g {aH ) - aH for all aH £ S} 

= {p e G | g(aH) = aH for all aH £ 5}. 

Let g £ Ker ip. Then g{aH) = aH for all aH £ S. In particular, gH — H. 
Thus, g £ H. Hence, Ker ip C H. ■ 

Corollary 5.4.10 Let G be a finite group and H be a proper subgroup of G of 
index n such that |G| does not divide n! Then G contains a nontrivial normal 
subgroup. 


Proof. From Corollary 5.4.9, Ker ip C H and GJ Ker ip is isomorphic to a 
subgroup of S n , where ip is as defined in Corollary 5.4.9. Therefore, |G/Ker ip\ 
divides n\ But |G| does not divide n! Hence, |Ker ip\ 1, proving that Ker ip 
is a nontrivial normal subgroup of G. 

Definition 5.4.11 Let G be a group and S be a G-set. Let a £ S, g £ G. 
Then a is called fixed by g if ga = a. If ga = a for all g £ G, then a is called, 
fixed by G. 

Theorem 5.4.12 (Burnside) Let S be a finite nonempty set and G be a finite 
group. If S is a G-set, then the number of orbits of G is 

where F(g) is the number of elements of S fixed by g. 


Proof. Let T = {(p, a) £ G x S | ga — a}. Since F(g) is the number of 
elements a £ S such that (p, a) £ T, it follows that \T\ = Y^gGG -F(p)- Also, \G a \ 
is the number of elements g £ G such that (p, a) £ T. Hence, |T| = Yla^s l^a| • 
Let S — [ai] U [ 02 ] U - • • U [a*], where {[ai], [ 02 ]; • • •, [ a k]} is the set of all 
distinct orbits of G on S. Then 


^ 2 ^( 9 ) —.^2 l^ a l + l^ a l + -X l^ a l- 

g&G aG[ai] a€[a2] a.G [a^] 


Suppose a, b are in the same orbit. Then [a] = [b] and \G : G a ] — |[a]| = |[6] 
[G : Gb\- This implies 

|G| |G| 


\G a 


A 


Gb 
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and so |G a | = |Gj,| . Thus, 

= |[ a l]| \G ai \ + | [02] | |Go 2 | H-b \[a>k]\ \Ga k \ 

— _M_ I q I _i_ l G l 1(7 I j_ 1 1^1 I Q 

~ |G ai | |Cra i 1 + |G a2 | 1^2 I + + |G a J 

= k\G\, 

where k is the number of distinct orbits. Consequently, 


k - 



E F (s)- ■ 

g(=G 


5.4.1 Worked-Out Exercises 

<)> Exercise 1 Let S be a finite G-set, where G is a group of order p n (p a 
prime). Let So = {a E S | ga — a for all g E G}. Show that 

\S\ = P |5 0 | - 


Solution: By Lemma 5.4.7, 

|5| = E[G:G a ], 

a&A 


where A is a subset of S containing exactly one element from each orbit [a] of 
G. Now a E So ^ and only if ga — a for all g E G, i.e., if and only if [a] = {a}. 
Hence, 


|si = !Soi+ E 


M 

ic„r 


Since |G a | ^ |G| for all a 6 A\So, is some power of p for all a E A\So. 
Thus, is divisible by p, proving that \S\ = p |So| - 


0 Exercise 2 Let S' be a finite G-set, where G is a group of order p n (jp a 
prime) such that p does not divide \S \. Show that there exists a E S such 
that a is fixed. 


Solution: Let So = {& € S j ga = a for all g E G}. By Worked-Out 
Exercise 1, \S\ = p |So| . Since p does not divide |Sj , p does not divide |So| . 
Thus, |So| 7 ^ 0. This shows that there exists a E So- Thus, a is fixed by G. 

0 Exercise 3 Let G be a finite group and H be a subgroup of G such that 
« • 

\H\ = p , where p is a prime and k is a nonnegative integer. 

(i) Show that 

[G : H] = p [N(H) : H] t 
where N(H ) = {g E G j gHg~ l — H}. 

(ii) If p|[G : H], show that N(H) ^ H. 
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Solution: (i) Let S — {xH \ x € G}. Define a left action of H on S by 
h{xH) — ( hx)H for all h E H, xH E S. Then S is an H- set. Let So — {^ff £ S 
| h{xH) = xH for all h E H}. By Worked-Out Exercise 1, \S\ = p |5o| . Now 
xH E S 0 if and only if h(xH ) = xH for all h E H if and only if x~ 1 hx E H 
for all h E H if and only if x~ l Hx C H. Now \x~ l Hx\ = \H\. Hence, xH E 5b 
if and only if x~ l Hx C H if and only if x~ l Hx = H (since H is finite and 
\x~ l Hx\ = |ff|) if and only if x E N(H). This shows that So is the set of all 
left cosets of H in N(H). Thus, |5o| = [N(H) : H]. Also, |5| = [G : H]. Hence, 
[G : H] = p [N(H) :H\. 

(ii) By (i), [G : H ] = p [ N(H ) : H}. Now p divides [G : H}. Thus, p divides 
[N(H) : H ]. Since [N{H) : H] > 1, it follows that N{H) ^ H. 

Exercise 4 Let G be a finite group. Let H be a subgroup of G of index p, 
where p is the smallest prime dividing the order of G. Show that H is a 
normal subgroup of G. 

Solution: Let 5 = {aH | a E G}. Since [G : H] = p , |5| = p. Thus, 
|A(5)| = p!, where A(5) is the group of all permutations on 5. Define a left 
action of G on 5 by g(aH ) = ( ga)H for all g E G, aH E 5. Now e(aH ) = aH 
and (g\g 2 )aH = {{g\g 2 )a)H — gi(g 2 aH). Hence, 5 is a G-set. Now the left 
action induces a homomorphism ip : G —> A(S ) defined by i/'(< 7 ) — T g ? where 
T g (aH) = (ga)H for all g E G, aH E S. Let g E Ker ip. Then g(aH) = eH for all 
aH E 5, in particular, gH = H. Hence, g E H. Thus, Ker ip C H. Now Gf Ker ip 
is isomorphic to a subgroup of A(S). Therefore, |G/Ker ip\ divides |A(5)| = p\ 
Let |G/Ker ip\ = n. Then n = [G : H][H : Ker ip] > p. Let n = p\p 2 • ■ - Pk, 
where pi are prime integers, i = 1 , 2 ,... ,k. Since pi divides \G\ and p is the 
smallest prime dividing the order of G, pi > p for all i = 1 , 2,..., k. Since n 
divides p!, we have each pi divides p\. Since each pi is a prime and p* > p, we 
must have i = 1 and p t = p. Thus, n = p. This implies that [H : Ker ip] = 1. 
Hence, H = Ker ip and so H is a normal subgroup of G. 

0 Exercise 5 Let G be a group of order pn, p a prime, and p > n. If if is a 
subgroup of order p in G, prove that H is a normal subgroup of G. 

Solution: Let 5 = {aH \ a E G}. Now \S\ = [G : H] = |^| = ^ = n. 
Define a left action of G on 5 by g(aH) = (ga)H for all g E G, aH E 5. Then 
5 is a G-set. Now the left action induces a homomorphism ip : G —> A(5) 
defined by ip(g) = r g , where r g {aH) = ( ga)H for all g E G, aH E 5. As in 
Worked-Out Exercise 4, Ker ip C H. Since |7f| = p, either Ker ip — {e} or 
Ker ip = H. If Ker ip = {e}, then G is isomorphic to a subgroup of A(S). This 
implies that |G| divides |A(5)|, i.e., pn|n! Therefore, p|(n — 1)! Since p > n, p 
does not divide (n — 1)! Thus, Ker ip = H. Hence, H is a normal subgroup of 
G. 


~Pu*uc- 7feaiAe»tafLca/ 



5.4. GROUP ACTIONS 


178 


Exercise 6 Let G be a group. Show that G is isomorphic to a subgroup of 
A{G). (This is Cayley’s theorem. Here we want to prove this result by 
the group action method.) 

Solution: G is a G-set, where the left action of G on G is defined by the 
group operation. This left action induces a homomorphism ip : G — » A(G) 
defined by ip{g) = T g , where T g {a ) = ga for all a, g £ G. Now Ker ip = {g (z G 
j r g = identity permutation on G} — {g G G \ ga — a for all a € G} = {e}. 
Hence, ip is a monomorphism. 

0 Exercise 7 Let G be a group of order 2m, where m is an odd integer. Show 
that G has a normal subgroup of order m. 

Solution: By Cayley’s theorem, G is isomorphic to a subgroup H of A(G ), 
where the isomorphism ip : G —> A{G) is given by ip{g) — r g , r g (a) — ga for all 
a, g £ G. Since G is of even order, there exists g E G such that o (g) = 2. Now 
r g {a ) — ga and r g (ga) = g 2 a — a. Hence, r g is the product of transpositions 
of the form (a ga). Since |G| = 2m, the number of transpositions appearing 
in the factorization of r g is m. Thus, r g is an odd permutation. Therefore, H 
contains an odd permutation. Define 

by for all a € H, 


f 0 ) 


1 if a is an even permutation 
— 1 if cr is a odd permutation 


where {1, —1} is a group under multiplication. Then f is an epimorphism of 
H onto {1,-1}. Hence, 

H/Ker f ~ { — 1,1}. 


Thus, 


2 = |{ —1.1}| = |V/Ker f\ 


I Ker f | 


2m 

I Ker /1 ■ 


Hence, |Ker /| = m. Consequently, H contains a normal subgroup of order m 
and so G contains a normal subgroup of order m. 


5.4.2 Exercises 

1. Show that Is = {1,2,3} is a 53-set, where the left action is defined by 
era = cr(a) for all a £ 53, a € Is. Find all distinct orbits of Ss- Find Gi, 
G2, and G3. 


2 . 


Let if be a subgroup of order 11 and index 4 of a group G. Prove that 
H is a normal subgroup of G. 



5.4. GROUP ACTIONS 


179 


3. Let if be a subgroup of a group G of index n. If H does not contain 
any nontrivial normal subgroups of G, prove that H is isomorphic to a 
subgroup of S n . 

4. Let G = GL(2,H) and S = R 2 . Show that S' is a G -set under the left 
action defined by 

(x, y ) = (ax + by, cx + dy) 
for all a b gG, (x,y)GR 2 . 

5. Let G be a group of order 77 acting on a set S of 20 elements. Show that 
G must have a fixed point. 

6 . Let G be a group. The left action of G on the set G is defined by 
conjugation, i.e., (g,x) —> gxg~ l for all g,x G G. Show that the kernel of 
the homomorphism -0 : G —> A(G) induced by this action is Z(G). 

7. Let G be a group of order 80 such that G has a subgroup of order 16. 
Show that G is not a simple group. 

8 . Show that a group of order 22 is not a simple group. 

9. Show that there are no simple groups of orders 6, 10, 14, 26, 34, and 58. 

10. Show that a group of order 8 cannot be a simple group. 

11. Show that a simple group of order 63 canpot contain a subgroup of order 

21 . 

12. Let G be a group of order 70 such that G has a subgroup of order 14. 
Show that G has a nontrivial normal subgroup. 
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Arthur Cayley(1821-1895) was born 
on August 16, 1821, in Cambridge, England. 
He was the second son. He entered Trin¬ 
ity College at the age of 17, as a pensioner. 
In 1842, he graduated as senior wrangler. 
Later he went to a law school and in 1849 
he became a lawyer. As a lawyer, he made 
a comfortable living and in fourteen years, 
during which he practiced his law profession, 
he wrote approximately 300 mathematical 
papers. 

In 1863, Cayley was elected to the new 
Sadlerian chair of pure mathematics at Cam¬ 
bridge, where he remained until his death. 
He died on January 26, 1895. 

For most of his life, Cayley worked on mathematics, theoretical dynamics, and 
mathematical astronomy. In 1876, he published his only book, Treatise on Elliptic 
Functions. Cayley wrote 966 papers; there are thirteen volumes of his collected papers. 

Cayley’s mathematical style was terse. He usually wrote out his results and pub¬ 
lished them without delay. He, along with J. J. Sylvester, his lifelong friend, is con¬ 
sidered to be the founder of invariant theory. He is also responsible for matrix theory. 
The square notation used for determinants is due to Cayley. He proved many impor¬ 
tant theorems of matrix theory, such as the Cayley-Hamilton theorem. He is one of 
the first, mathematicians to consider geometry of more than three dimensions. 

In 1854, Cayley published, “On the theory of groups depending on the symbolic 
equation 6 n = 1.” In this paper, he considered a group as a set of symbols, 1 ,a,j3, ..., 
all of them different and such that the product of any two of them (no matter in what 
order), or the product of any one of them into itself, belongs to the set. This formula¬ 
tion of a group as a set of symbols and multiplications is different from the formulation 
considered by the earlier mathematicians. The paper is generally regarded as the ear¬ 
liest work on abstract group theory and Cayley is regarded as the founder of abstract 
group theory. He is best known for the theorem that every finite group is isomorphic 
to a suitable permutation group. In his article of 1854, he introduced a procedure 
for defining a finite group by listing its elements in the form of a multiplication table, 
known as a Cayley table. Cayley also proved a number of important theorems. 







Chapter 6 


Direct Product of Groups 


6.1 External and Internal Direct Product 

In Section 2.1, Exercise 25, we defined the direct product G x H of two groups 
G and H. In this section, we extend this concept to any finite family of groups 
and obtain their basic properties. 

The notion of a direct product is used to factor a group into a product 
of smaller groups. This factorization gives structural properties of a group. 
In some cases, it allows for the complete characterization of a certain type of 
group. In Chapter 9, the concept of direct product is used to give a complete 
system of invariants for a finitely generated Abelian group, i.e., a finite set of 
positive integers which implies the isomorphism of any two finitely generated 
Abelian groups that have this set of integers. 

Recall that I n = {1,2,..., n}. 

Let {Gi | i £ I n ] be a family of groups. Let 

G — G\ x G 2 x • ■ ■ x G n = {(&i, CL 2 -, ■ ■ ■, a n ) | a i £ Gi } i £ I n }■ 

Define * on G as follows: for all (ai,< 22 ,..., a n ), ( 61 , 62 ,, b n ) € G 

Ol, & 2 , • • • > a n) * (^ 1 ) ^ 2 ? ■ • • 1 ^n) = (^ 1 ^ 1 ? a 2 ^ 2 ) • ■ • > a n^n)• 

In the following theorem, we show that * is a binary operation on G and 
that the set G together with the binary operation * is a group. We also obtain 
several important properties of G. 

Theorem 6 . 1.1 Let {G{ \ i £ I n } be a family of groups and G = G\ x Gb x 
• ■ • x G n . Let be the identity of Gi for all i £ I n . Then (G, *), where * is 
defined above, is a group with e — (ej, e 2 ,..., e n ) the identity el.em.ent, and for 
all (( 2 } , £22 ) ■ ■ • > ^ 7 1 ) £ G , 

(Tl, ^2 ®n) (^i 7 ®2 > ■ • 1 » ) * 
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Furthermore, let 

Hi {^1 ■> ■ i ^i—1 ■> j ^7+1 j • ■ - > ) | € Gi } 

for all i G I n - Then the following assertions hold. 

(i) Hi is a normal subgroup of G for all i G I n . 

(ii) For all a (z G, a can be uniquely expressed as a — h\h 2 • • ■ h n , where 
hi £ Hi, i € In- 

(Hi) H t fl (HiH 2 • ■ • Hi-iHi+i ■ ■ • H n ) — {e} for all i G I n . 

(iv) G — H 1 H 2 ■ • ■ H n . 


Proof. First we note that * is single-valued and if (ai,..., a n ), (b \,..., b n ) 
G G, then (ai,..., a n ) * ( 61 ,..., b n ) = (aibi,... , a n b n ) G G since aibi G Gi for 
all i. Thus, * is a binary operation on G. We ask the reader to verify that * is 
associative. Now e = (ei,e 2 ,..., e n ) G G and for all a = ( 01 , a 2 , ■ ■ ., a n ) G G, 

ae — (cri, 0 . 2 1 • ■ ■ 1 ®n)(ei, e 2 ,..., 6 n) 

(aiei,a 2 e 2 ,-. u. n e n } 

— (&i , a 2 ,..., a n ) 

— a. 

Similarly, ea — a. Hence, e is the identity of G. To show that every element of 
G has an inverse in G, let ( 01 , a 2 ,..., a n ) G G. Then (a^ 1 , alf 1 ,..., a” 1 ) G G 
since a ” 1 G Gi for all i and 

(fll, CL 2 , . . • , , 0<2 , . . . , a n ) , (22^2' ? • • • > Q'n&n ) 

= ( e i> e 2 ,..., e n ) 

— e. 


Similarly, (a ^ 1 ,af l ,..., a“ 1 )(ai, a 2 ,..., a n ) = e. Thus, every element of has 
an inverse. Consequently, (G, *) is a group. We also note that by the uniqueness 
of the inverse of an element 


(( 2 i, ( 2 2 j • • • j ^n) i @2 j • • * > )■ 

(i) Let i € In- Since (ei, e 2 ,... ,e n ) G Hi, Hi ^ <f. Let a = (ei,... ,Oi,... ,e n ), 
6 = (ei,..., 6*,..., e n ) G Hi. Then 


ab 1 = (ej,. 

■ •, ai,..., e n ) (fii >..., bi , 

= (ei,. 

.., (2j,.,., ) (ci,... j 6^ 

- ( e i>- 

.., O'ibi ,... , en) G H7. 


Thus, Hi is a subgroup of G by Theorem 4.1.3. Let g — (gi,g 2 ,... ,g n ) G G. 
Then 


9^9 (^15 92. j • ■ •»^ 77 ) (ci j • • • , O 7 , • • •, 677 )((?i, g2 , . . . , prz) 

~ (#1) 52> • • • J ^7^7, . . • ) i^2 1 • • • } 9n ) 

= (ei, ■ • • ,&Oi0 t rl > ...,e n )eHi since gmgf 1 G Gi. 
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Hence, Hi is a normal subgroup of G. 

(ii) Let a = (aj, < 22 ,..., a n ) E G. Let hi = (ej,..., a*,..., e n ) for all z E 
Then a = h\h 2 • • • h n . To show that the representation of a is unique, let 
a — k\k ,2 ■ • ■ k n be another representation of a, where k{ E Hi for all i E I n . Let 
h = (ei,..., bi,..., e n ) E Hi for all z E I n - Then 

(®1, 0,2 , . . . , H n ) h\ /l2 ‘ ‘ ‘ hn H k\ k,2 ' ■ ' kji (&1, ■)■•'•> bn) • 

This implies that a* = bi for all z E I n and so hi = ki for all i E I n . Hence, the 
representation of a is unique. 

(iii) Suppose a E HN l (Hi • • • Hi-\Hi + \ • • ■ H n ). Then a E Hi and 

CL E H\ ‘ • • Hi — \Hi-\-\ • ■ • Hn- 

Since a E Hi, a = (e \,..., a^,..., e n ) E for some a\ E Gi and since 

a E H\ ■ • • Hi—iH % -|_i • • • Hn-, 

we have a — h\h 2 • ■ • hi-ihi + i ■ • ■ h n , where hj = (ei,..., a,j, ..., e n ) E Hj for 
some aj E G 3 . Thus, 

(ei,..., di,..., c n ) — cl — h\ hi— 1 hi±\ • ■ • h n — (cli ,..., cii —j, Gi, ..., n n )■ 

This implies that for all i E I n - Hence, 

Hi n (HiH 2 ■ • • Hi-iH i+l ■ • • H n ) — {e}. 

(iv) The desired result follows from (ii). ■ 

Definition 6.1.2 The group G of Theorem 6.1.1 is called the external direct 
product of the groups Gi, i = 1,2 ,... ,n. 

Theorem 6.1.1 motivates the following definition. 

Definition 6.1.3 Let G be a, group a,nd {Ni \ i E I n } be a family of normal sub¬ 
groups of G. Then G is called the internal direct product of N\, N 2 ,..., N n 
if every a E G can be uniquely expressed as a = a\a 2 ■ ■ ■ a n , where ai E 1V Z for 
all i E I n - 

Let G = Gi x G 2 x • • ■ x G n be the external direct product of the groups Gi. 
Let Hi be defined as in Theorem 6.1.1. Then G is the internal direct product 
of H\,H 2 , - - -, H n by Theorem 6.1.1(h). 

Theorem 6.1.4 Let G be a group and, {Ni \ i E I n } be a family of norm,a,l 
subgroups of G. Then G is an internal direct product, of {Ni \ i E I n } if and 
only if G = N\N 2 • • • N n and Ni fl (N\ • • ■ Ni-\Ni + i ■ • • N n ) = {e} for all i E I n - 
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Proof. Let G be an internal direct product of {Ni \ i G I n }. Let a G G. 
Then a = aia 2 ■ • • a n for some a z G Ah, i G I n . Thus, a G N 1 N 2 - • • N n and this 
implies that G — AhIV 2 • • ■ N n . We now show that Ni H (N\ ■ ■ • Ah-iAh +1 • • • N n ) 
= {e} for all i G I n . Let i G I n and a G N Z D (Ah ■ ■ ■ Ah-iAh+i • • • N n ). Then 
a G Ah and a G Ah • ■ • Ah-iAh+i • ■ ■ N n . This implies that we can write a ~ 
aia 2 • ■ ■ ai-\a l+ i • • ■ a n for some a 3 G Nj, j G I n \{i}. Hence, 

ee • • • a • ■ • e = a = 0 , 10,2 ■ • • ■ ■ • o n 

are two representations of a, where Oj G A^, j G I n \{i}. Since the representa¬ 
tion of a is unique, a = e. Hence, AhH (Ah • ■ • Ah_i Ah- 1-1 ■ • • N n ) = {e}. 

Conversely, suppose G = Ah Ah • • • N n and AhH (Ah • • • Ah-iAh+i - - • N n ) = 
{e} for all i G I n - Then Ah fl Nj — {e} for all i ^ j and hence uv — vu for all 
ue Ni and for all v G Nj by Exercise 12 (page 137). Let a — aia 2 ■ ■ ■ a n = 
bib 2 * • • b n be two representations of a, where a^, bi G Ah, i G I n . Then 

e = a~ 1 a 

= (aia 2 -- ■a n )~ l (bib 2 -- -b n ) 

= a n a n— 1 ' ' ‘ a l ^ 1^2 ' ' ' b n 

■ (2j bia 2 b 2 ■ ■ ■ o n b n 

since for all i ^ j if u G Ah and v G Nj, then uv = vu. This implies that 

b~ l di = af l b\ ■ ■ ■ a^bi-ia^bi+i • • • a~ l b n G Ah fl N X N 2 - - ■ Ah-iAh+i ■ ■ - N n 

for all i € I n . Since Ah fl Ah Ah • • ■ Ah-i Ah+i ■ • • A^ n = {e}, we must have 
b~ l O{ = e or a t = bi for all i G I n . Thus, a can be written uniquely as 0102 • • ■ a n , 
where Oi G Ah, i G I n - Hence, G is an internal direct product of {Ah | i G I n }. ■ 

In the following theorem, we show that if a group G is an internal direct 
product of a family of normal subgroups {Ah | « G / n }, then G can be viewed 
as an external direct product of the groups Ah’s. 

Theorem 6.1.5 Let G be an internal direct product of a family of norm,a,l 
subgroups {Ah | i G I n }. Then 

G ~ Ah x N 2 x • • ■ x N n . 

Proof. Let a G G. Then a can be expressed uniquely as a — 0102 ■ ■ -a n , 
where a* G Ah, i G I n . Define 

/ : G —> Ni x N 2 x • • • x N n 


by 


/(a.) — (ai , 00 ,..., a nl # —— # 
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for all a E G. From the definition of /, it follows that / is well defined and onto 
x N2 x • • • x N n . And from the uniqueness of the representation of a, it follows 
that / is one-one. We now show that / is a homomorphism. Let a = d\d 2 ■ ■ ■ a n 
and b = b\b 2 ---b n be two elements of G, where di, bi E Ni,i E I n . Now 
N{ fl Nj = {e} for all i ^ j and so uv — vu for all u E Ni, v E Nj. This implies 
that 

ab d^o ,2 ■ ■ ■ d n b\b 2 ' ' ■ bji - d\b\d 2 b 2 ■ ■ ■ dnbn- 

Thus, 

f(db) = (ai&i, 0262) • • ■ > a n 6 n ) 

— (fll, (22 > • • • ) ^n){bl j ^2 1 • • • i bn) 

- f(a)f(b) 

and so / is a homomorphism. Consequently, G ~ N\ x A ^2 x • • ■ x N n . ■ 

Considering Theorem 6.1.5, let 11 s agree to write G = N\ x N 2 x ■ • • x N n 
when G is an internal direct product of a family of normal subgroups {Ni \ 
i E I n } • 


6.1.1 Worked-Out Exercises 

0 Exercise 1 Let G and G\ be groups and / : G —► G\ be a homomorphism. 
Let IT be a normal subgroup of G. Suppose that f\fj ■ H —■* G 1 is an 
isomorphism of H onto G\. Prove that G = Hx Ker /. Give an example 
to show that this result need not be true if H is not a normal subgroup. 

Solution: Let d E G. Then/(a) E G\ = f(H). Thus, there exists h E H 
such that f{d) — f(h). Now /(a) — f{h ) implies that /(/ 2 _1 o) = e\ and 
hence h~ 1 d E Ker /. Therefore, there exists b E Ker / such that b — h~ 1 d 
ot d — hb. Hence, G = iTKer /. Suppose a E HflKer /. Then d E H and 
f{o) = e\ = /(e). Since /\h is one-one, /(a) = /(e) implies that a = e. 
Therefore, ITnKer / = {e}. Thus, H and Ker / are normal subgroups of G 
such that G = iTKer / and iTflKer / = {e}. Consequently, G = Hx Ker /. 

This result need not be true if H is not a normal subgroup of G. For let 
G — S 3 and G 1 = {g') be such that o(</) = 2, i.e., G\ is a cyclic group of order 
2. Let H — ((1 2)). Define / : G —> G\ by /(e) = e, f{x) = e if x is an element 
of order 3, and fix) — g' if x is an element of order 2. Then /\h : H Gi is 
an isomorphism of H onto G\. Now Ker / = {e, (1 2 3), (1 3 2)} = ((1 2 3)) . 
But G ^ Hx Ker / (see Exercise 14, page 188.) 


Exercise 2 Let G be a group and H and K be subgroups of G such that 
G — H x K. Let N be a normal subgroup of G such that N fl H = {e} 
and N D K = {e}. Prove that N is commutative. 
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Solution: Since G = H x K, H and K are normal subgroups of G. Now 
for all n G N, h G H, k G K,nh — hn , and nk — kn by Exercise 12 (page 
137). Let a, 6 G TV. Then there exist h G H,k G K such that b = hk. Now 
a6 = a(hk ) = (ah)k — {ha)k — h{ak ) = h(ka) = {hk)a = 6a. Hence, 77 is 
commutative. 


<0 Exercise 3 Let (7 be a group and A and B be subgroups of (7. If 

(i) <7 = AB, 

(ii) ab = ba for all a G A, 6 G B, and 

(iii) A ft B = {e}, 

prove that (7 is an internal direct product of A and B. 


Solution: Let us first show that A and B are normal subgroups of G. For 
this, let a G A, g G (7. There exist c G A and 6 6 B such that g = c6 by 
(i). Now ga<? -1 = (c6)a(c6) -1 = c6a6 _1 c _1 = ca66~ 1 c~ 1 — cac^ 1 G A. Hence, 
A is a normal subgroup of G. Similarly, B is a normal subgroup of G. Let 
g G G. Then g — ab for some a G A, 6 G B . Suppose g = aib\, where a\ G A, 
6 i G B. Then ab — ai6i, which implies that a^a = b\b~ l G A D B = {e}. 
Thus, a = ai and 6 = b\. Therefore, we find that every element g of G can be 
expressed uniquely as g = a6, a G A, 6 G B. Consequently, G is an internal 
direct product of A, B. 


0 Exercise 4 Let G be a cyclic group of order mn, where m, n are positive 
integers such that gcd(m,n) — 1. Show that G ~ Z m x Z n . 


Solution: Since m divides |(7| and G is cyclic, there exists a unique cyclic 
subgroup A of G of order m by Theorem 4.2.10. Similarly, there exists a unique 
cyclic subgroup B of G of order n. Now |A Pi B\ divides |A| = m and |A D B\ 
divides \B\ = n. Since gcd(m,n) = 1 , \A fl B\ — 1. Thus, by Theorem 4.3.15, 


\AB | = 


|Aj 1-BI 

\Ar\B\ 


mn 


1 


= mn — |C|. 


Since AB C (7, |AB| = |(7|, and G is finite, we must have G = AB. Hence, 
G = AB, A fl B = {e}, and A and B are normal subgroups of G. Thus, 
G = A x B ~ Z m x Z n . 


<0 Exercise 5 Let A and B be two cyclic groups of order m and n, respectively. 
Show that A x B is a cyclic group if and only if gcd(m, n) = 1. 


Solution: Let A = (a) for some a G A and B = (b) for some 6 G B. Suppose 
gcd (m,n) = 1. Let g — (a, 6). Then g mn = (a, b) mn = (a mn , b mn ) = (e^e#), 
where denotes the identity of A and e# denotes the identity of B. Suppose 
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o(g) — t. Then (a, b ) l — (e A , e#). This implies that a 1 — e A and b l = e B . Thus, 
m\t and n\t. Since gcd(m,n) = 1, mn\t. Hence, rrm is the smallest positive 
integer such that g 77171 = e. Thus, o[g) = rrm. Now \A x B\ — mn and A x B 
contains an element g of order mn. As a result, A x B is cyclic. Conversely, 
assume that A x B is cyclic and gcd(m, n) — d ^ 1 . Let (a, 6 ) 6 A x B. Then 
o(a)]m and o( 6 )|n. Now ^ is an integer and ^ < mn. Also, 

(a,b) d =( a d,b ~d)=-- {e A ,e B ) : 

Hence, A x B does not contain any element of order mn. This implies that 
A x B is not cyclic, a contradiction. Therefore, gcd(m, n) — 1 . 

Exercise 6 Show that |Aut(Z 2 x Z 2 ) I — 6 . 

Solution: First note that Z 2 x Z 2 has four elements, e = ([0], [0]), a = 
«1],[0]), b = ([0],[1]), c = ([1], [1]), and o(a) = o(b) = o (c) = 2. Let / € 
Aut(Z 2 x Z 2 ). Then o (f(x)) = o(x) for all x € Z 2 x Z 2 . Hence, / maps {a, b, c} 
onto {a, 6 ,c}. Thus, / is a permutation of {a, 6 , c}. Since there are only six 
permutations of {a, 6 , c}, it follows that |Aut(Z 2 x z 2 )| < 6 . Now a + b — c, 
a + c = 6 , 6 + c = a, and a + a = e = 6 + 6 — c + c. Thus, any permutation of 
{a, 6 , c} gives rise to an automorphism of Z 2 x Z 2 . For example, let a : a —» b, 
b —> c, c —> a, and e —» e. Now a(a + b) — a(c) = a and a(a) + a(b) — 6 + c = a. 
Therefore, a(a + b) = a(a)+o( 6 ). Similarly, a(a + c) = a(a)+a(c), a(b + c) = 
a ( 6 ) + a(c), a(a + a) = a (a) + a(a), a (6 + 6 ) = a(b) + a(b), and a(c + c) = 
a(c) +. a(c). Hence, o is an automorphism. Thus, |Aut(Z 2 x Z 2 )| = 6 . 

6.1.2 Exercises 

1. Prove that the direct product of two groups A and B is commutative if 
and only if both groups A and B are commutative. 

2 . Let A, B , (7, and D be four groups such that A ~ C and B ~ D. Show 
that A x B ~ C x D. 

3. Let G be a group such that G — H\ x H 2 x • • ■ x H n , where Hi is a 
subgroup of G. Let iQ be a normal subgroup of G such that Ki C Hi , 
1 < i < n. Let K — K\ x A 2 x - • - x K n . Show that 

G Hi H 2 H n 

K ~ K\ X K 2 X X K n 

4. Let Gi be a group, 1 < i < n. Show that 

Z(Gi x G 2 x • • • x G n ) = Z(G0 x Z(G 2 ) x • • • x Z(G„). 

PuJlc. 
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5. Let G be a group and H and K be subgroups of G such that G — H x K. 
Show that G/K ~ H and G/H ~ K. 

6 . Let G be a finite cyclic group of order mn, where m and n are relatively 
prime. Let H and K be subgroups of G such that \H\ = m and \K\ = n. 
Show that G = HxK. 

7. Prove that Aut(Z 2 x Z 2 ) — S 3 . 

8 . Let G be a group and H and K be normal subgroups of G such that 
G = HK. Let H C\ K — N. Show that 

G/N ~ H/N x K/N. 

9. Prove that a finite Abelian group G is the internal direct product of 
subgroups H and K if and only if (i ) H C\K = {e} and (ii) |(7| = |if||K|. 

10. Show that the Klein 4-group is isomorphic to the direct product of a 
cyclic group of order 2 with itself. 

11 . Show that a cyclic group of order 4 cannot be expressed as an internal 
direct product of two subgroups of order 2 . 

12. Show that a cyclic group of order 8 cannot be expressed as an internal 
direct product of two subgroups of order 4 and 2, respectively. 

13. Can the cyclic group Z 12 be expressed as an internal direct product of 
two proper subgroups? 

14. Show that S 3 cannot be written as a direct product of proper subgroups. 

15. Show that D 4 cannot be expressed as an internal direct product of two 
proper subgroups. 

16. Consider the groups Z 2 x S 3 , Z 2 x Zq, and Z 12 . Are any two of these 
. groups isomorphic? Is any one noncommutative? 

17. Show that the additive group (Z, +) cannot be expressed as an internal 
direct product of two nontrivial subgroups. 

18. Show that the additive group (Q, +) cannot be expressed as an internal 
direct product of two nontrivial subgroups. 





6.1. EXTERNAL AND INTERNAL DIRECT PRODUCT 


189 


Heinrich Weber (1842-1913) was born on May 5, 1892, in Heidelberg, Ger¬ 
many. In 1860, he studied mathematics and physics at the University of Heidelberg. 
He received his Ph.D. in 1863. He was appointed as extraordinary professor at the 
University of Heidelberg in 1869 and also taught at Edgenossische Polytechnikum in 
Zurich, the University of Konigsberg, the Technische Hochschule in Charlottenburg, 
and the universities of Marburg, Gottingen, and Strasbourg. 

Weber was a friend of Richard Dedekind and they often collaborated. Together 
they edited the work of Riemann in 1876. Herman Minkowski and David Hilbert were 
among Weber’s students. 

Weber’s main research interests were in analysis and its applications to mathemat¬ 
ical physics and number theory. He was encouraged by von Neumann to investigate 
physical problems and by Richelot, to study algebraic functions. Along the lines of 
Jacobi, he worked on the theory of differential equations. He proved Abel’s theorem in 
its most general form. He also worked on physical problems concerning heat, static and 
current electricity, the motion of rigid bodies in liquids, and electrolytic displacement. 

Weber’s most profound and penetrating work is in algebra and number theory. He, 
jointly with Dedekind, did work of fundamental importance on algebraic functions. 

In 1891, Weber gave the “modern” definition of an abstract finite group. One of 
his outstanding accomplishments was the proof of Kronecker’s theorem, which states 
that absolute Abelian fields are cyclotomic. 

Weber was an enthusiastic and inspiring teacher who took great interest in educa¬ 
tional questions. He died on May 17, 1913. 





Chapter 7 

Sylow Theorems 


In general, the converse of Lagrange’s theorem does not hold (Exercise 19, page 
138). In this chapter, we prove the Sylow theorems, which are very helpful 
in determining whether a given finite group has subgroups of specific orders. 
There are several known proofs of the Sylow theorems. In this text, we give two 
different proofs of the Sylow theorems, one based on the notion of group action 
(Section 5.4) and another based on the notion of conjugacy classes (Section 
7.1). In Section 7.4, we will apply the Sylow theorems to determine certain 
simple groups. 

7.1 Conjugacy Classes 

In this section, we define an equivalence relation commonly known as a con¬ 
jugacy relation on a group. This relation partitions the group into disjoint 
equivalence classes, which helps ris to obtain a decomposition of the order of 
a finite group. This particular decomposition of the order of a finite group is 
known as the class equation. The class equation is very useful in determining 
the nature and structure of finite groups. The results obtained in this section 
will be used throughout this chapter. 

Definition 7.1.1 Let G be a group and a be an element of G. Then the cen¬ 
tralizer or normalizer of a in G, denoted by C(a), is the set of all elements 
of G which commute with a, i.e., 

C{a ) = {b £ G | ba = ab}. 

We note that C{a) — G if and only if a is in the center of G. 

Let G be a group and qeG. An element b E G is said to be a conjugate 
of a in G if there exists c € G such that b = cac~ l . 

In the following theorem, we prove some basic properties of the centralizer 
of an element. 
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Theorem 7.1.2 Let a be an element of a group G. Then 

(i) C(a ) is a subgroup of G. 

(ii) The relation p on G defined by 

p ~ {(a,b) e G x G \ b is a conjugate of a} 

is an equivalence relation, known as conjugacy, on (7; the equivalence class 
[a] of the relation p is called, a, conjugacy class of a in G. We denote the 
conjugacy class [a] by Ci(a). 

(in) The number of conjugates of a is equal to the index ofC(a ) in G , i.e., 

|C,(a)| = [G : C(o)]. 

Proof. (i) Since ea = a = ae, e G C(a ) and so (7(a) ^ (f. Let b,c G (7(a). 
Then ab - ba and ac — ca. Also, ac — ca implies that ac -1 — c~ l a. Now 
a(bc~ 1 ) = (ab)c^ 1 = ( ba)c~~ l = b(ac~ l ) = 6(c -1 a) — ( bc~ l )a. Therefore, 6c _1 G 
(7(a). Hence, (7(a) is a subgroup of G by Theorem 4.1.3. 

(ii) Note that for all a € G, a = eae -1 . Thus, for all a G (7, a is a conjugate 
of a. Hence, p is reflexive. For symmetry, let (a, 6) G p. Then there exists 
c G G such that b — cac~ l . This implies that a = c~ 1 bc and so (b, a) G p. 
Hence, p is symmetric. To show that p is transitive, let (a, 6), (6,c) G p. Then 
there exist u,v G G such that b = uau~ l and c = vbv~ l . This implies that 
c = ( vu)a(yu)~ l and so (a, c) G p. Thus, p is transitive. Consequently, p is an 
equivalence relation. 

(iii) Let a G G. Let hi denote the set of all distinct left cosets of (7(a) in G. 
Then \hi\ - [G : <7(a)]. Now bab~ l G C t (a) for all b G G. Define / : hi -► G t (a) 
by f(bC(a)) — bab~ 1 . Let 6, c G G. Now bC(a ) = cC{a ) if and only if c~ l b G 
(7(a), which in turn is equivalent to ( c~ l b)a — a(c“ 1 6). Now (c“ 1 6)a = a(c“ 1 6) 
if and only if bab~ l — cac~ l . Therefore, / is a one-one function. From the 
definition of /, it follows that f maps hi onto Ci(a). Hence, / is a one-one 
function of hi onto (7/(a). Consequently, |C)(a)| — \hi\ — [G : (7(a)). ■ 

Corollary 7.1.3 Let G be a finite group. Then 

|G| = £[G : C(o)], 

a 

where the summ.a.tion is over a complete set of distinct conjugacy class repre¬ 
sentatives. 


Proof. By Theorem 7.1.2(h), G = U a (7/(a), where the union runs over a 
complete set of distinct conjugacy class representatives. The corollary follows 
since the distinct conjugacy classes are mutually disjoint and \Ci(a)\ — [(7 : 
(7(a)] for all a G G by Theorem 7.1.2(iii). ■ 





7.1. CONJUGACY CLASSES 


192 


Corollary 7.1.4 Let G be a finite group. Then 

\G\ = \Z(G)\+ £ [G:C(a)j, (7.1) 

a(Z(G) 

where Z(G ) denotes the center of G and the summation runs over a. complete 
set (possibly empty) of distinct conjugacy class representatives, which do not 
belong to Z(G). 


Proof. First observe that a E Z{G) if and only if C{a) = G if and only if 
[G : C(a)\ — 1. By Corollary 7.1.3 

|G| = £[G : G(o)], 

a 

where the summation is over a complete set of distinct conjugacy class repre¬ 
sentatives. This implies that 

|G|= £ [G:C(a)]+ £ [G: C(a)|. 

a £Z(G) a<£Z(G) 

Since a € Z[G) if and only if [G : C(a)] = 1 , it follows that J2 a £Z(G)[G '• 
C(a)\ = \Z(G)\. Hence, ’ ’ 

\G\ = \Z(G)\+ £ [G : G(a)j, 

a <£Z(G) 

where the summation runs over a complete set (possibly empty) of distinct 
conjugacy class representatives which do not belong to Z(G). ■ 


Eq. (7.1) in Corollary 7.1.4 is called the (conjugacy) class equation. 


Example 7.1.5 Consider S 3 . By Worked-Out Exercise 1 (page 94), it follows 
that S 3 has three conjugacy classes, namely, 


1 2 3 
1 2 3 


1 2 3 

2 3 1 


1 2 3 
3 1 2 


, and 


1 2 3 

2 1 3 


1 2 3 
1 3 2 


12 3 
3 2 1 


The class equation reads 


1*1 - \Z(G)\ + [S 3 : C(( 2 3 J)1 + [* = ^((2 1 3 


)] 


6 = 


+ 


+ 3. 
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Example 7.1.6 Consider the group of symmetries of the square. The distinct 
conjugacy classes are {ri 80 }, {r 360 }, {r 9 o,r 27 o}, {d\,d 2 }, {h,v}. The class equa¬ 
tion reads 

\G\ = \Z(G)\ 4- [G:C(r 90 )} + [G:C(d 1 )] + [G : C{h)] 

8 = (1 + 1 ) + 2 + 2 + 2 . 

Until now our discussion focused on the conjugacy class of an element of a 
group. We now extend our discussion to the conjugate subgroup of a group. 
We will be mainly interested in determining the number of distinct conjugates 
of a subgroup induced by the elements of another subgroup. We begin with 
the following theorem. 

Theorem 7.1.7 Let H be a subgroup of a group G and a £ G. Then aHa~ l 
is a subgroup of G, called a conjugate of H. Furthermore, H — aHa~ l . 

Proof. By Worked-Out Exercise l(i) (page 106), aHa~ 1 is a subgroup of G. 
Now define / : H —*■ aHa~ l by f(h) — aha~ l for all h £ H. As in Worked-Out 
Exercise 1 (ii) (page 106), f is a one-one function from H onto aHa~ l . To show 
that / is a homomorphism, let hi, h 2 € H. Then /(hih 2 ) = a(hih 2 )a _1 = 
(ahio~ 1 )(ah 2 a“ 1 ) = /(hi)/(h 2 ). Hence, H ~ aHa~ l . ■ 

Definition 7.1.8 Let H be a subgroup of a group G and a £ G. If aHa^ 1 — H , 
then H is called invariant under a. 

Definition 7.1.9 Let H and K be subgroups of a group G. Let Nk{H) denote 
the set 

N k (H ) = {k £ K | kHk~ l = H }. 

Nk(H) is called the normalizer of H in K. 

It follows that Nk{H) — Ng(H ) n K. 

Theorem 7.1.10 Let H and K be subgroups of a group G. Then Nk(H ) is a 
subgroup of K. 

Proof. Since e £ K and eHe^ 1 — H, e £ Nk(H) and so Nk(H) ^ <b. Let 
k\, fc 2 £ Nk{H). Then k\Hkf l — H = Now H = implies 

that H = kf 1 Hk 2 . Thus, 


H = k Y Hkf l 

= k\{kf l Hk2)k^ 1 
= (k\kf l )H(k\kf l )~ 1 . 

~PilLlc- TfeoiAewtotLea/ 
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Hence, k\k 2 l E Nk(H). Thus, N^(H) is a subgroup of G. ■ 

When K — G, we write N(H) for Nq{H ) and refer to the subgroup N(H ) 
simply as the normalizer of H. By Exercise 11 (page 137), N(H ) is the largest ■ 
subgroup of G in which H is normal. Of course N{H) = G when H is a normal 
subgroup of G or when G is commutative. 


Example 7.1.11 

group 


Consider the symmetric group S 3 . In Example 4.3.2, the sub- 


H' = 


1 2 3 
1 2 3 



is not a normal subgroup of S 3 . We note that N(H') = H'. 


Theorem 7.1.12 Let H a.nd K be subgroups of a group G. The number of 
distinct conjugates of H induced, by the elem.en.ts of K is equal to [K : Nk(H)], 
the index of Nk(H) in K. 


Proof. Let T be the set of distinct conjugates of H induced by the elements 
of K , i.e., T = {kHk~ l \ k E K} and let S be the set of distinct left cosets 
of Nk(H) in K, i.e., S = {aNx(H) ) a E K}. To show that the number of 
distinct conjugates of H induced by the elements of K is equal to [K : Nx(H)\, 
the index of Nx(H) in K , we need to show that there exists a one-one function 
of T onto S. 

Define f : T —> S by f(aHa~ l ) = aNx(H) for all aHa~ l £ T. Let k \, 
&2 E K. Then k\Hkf l = if and only if H — (kf 1 k 2 )H(kf 1 k 2 )~ l . Now 

H — (kf l k 2 )H(kf l k 2 )~ l if and only if kf 1 k 2 E Nx(H) and the latter is true 
if and only if kiNx(H) — (i 7 ). Thus, we have shown that / is a one-one 

function. From the definition of /, it is immediate that / is onto S. Hence, the 
number of distinct conjugate subgroups of H by the elements of K is equal to 
the number of distinct cosets of Nx(H) in K. ■ 

Corollary 7.1.13 Let H and K be finite subgroups of a group G. If H is 
invariant under n elements of K, then H has \K\/n conjugates by elements of 

K. 


Proof. By hypothesis, \Nx(H)\ = n. Hence, \K\ = [K : Nk{H)] ■ \Nk{H )| 
by Lagrange’s theorem. The corollary is now immediate by Theorem 7.1.12. ■ 


7.1.1 Worked-Out Exercises 

0 Exercise 1 Let G be a finite group and a E G be such that a has only two 
conjugates. Prove that C(a) is a normal subgroup of G. 

“Public. Ma.£4a#>ta£lea./ 
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Solution: By Theorem 7.1.2, [C7 : C(a)} = |CT/(cx)| . Now |C/(a)| = 2. 
Hence, [G : (7(a)] = 2, proving that (7(a) is a normal subgroup of G. 

0 Exercise 2 Let G be a finite group that has only two conjugate classes. 
Show that j(7| = 2. 

Solution: Let |(7| = n. Let a G G and a ^ e. Then G = Ci(e) U (7/(a). 
Since |C/(e)| = 1, \Ci(a)\ = n — 1. Hence, n — 1 = |C;(a)| = [G : C(a )] divides 
|G| = n. This is possible only if n = 2. 

Exercise 3 Prove that there exists no finite nontrivial group every nonidentity 
element of which commutes with exactly half the elements of the group. 

Solution: Let G be a group of order n > 1 such that every nonidentity 
element of G commutes with exactly half the elements of G. Let a G G and 
a ^ e. Then |(7(a)| = nf 2. Hence, |(7/(a)| (= [G : C(a)\ = 2. Now |G| = 
\Cl( e ) l + X] a ^e |C/( a )l , where the summation runs over a complete set of distinct 
conjugacy class representatives. Since \Ci(e)\ — 1 and \Ci(a)\ = 2 for all 
e / a £ G, we find that |G| is odd. But |C(a)| = ^ ^ shows that |G| is 

even. This contradiction shows that there cannot exist any group of this type. 

7.1.2 Exercises 

1. Let G be a group and a E G. Prove that a G Z(G) if and only if Ci(a) = 
{a}. 

2. Let G be a finite group. Prove that if there exists an element a G G with 
exactly two conjugates, then G contains a nontrivial normal subgroup. 

3. Prove that a subgroup if of a group G is a normal subgroup if and only 
if H is the union of conjugacy classes of G. 

4. Let G be a group, H a subgroup of G, and a G G. Prove that N{aHa ~ l ) = 
aN(H)a~ l . 

5. Let H and K be subgroups of a group G. Prove that H is normal in K 
if and only if H C K C Ng{H). 

6. Let G be a group and H and K be subgroups of G. Prove that if H and 
K are conjugates, then Nq{H) and Nq{K ) are conjugates. 

7. Find the class equation for S$. 
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7.2 Cauchy’s Theorem and p-groups 

In this section, we prove an important theorem which gives a partial converse of 
Lagrange’s theorem. This interesting theorem is due to Cauchy. First we will 
prove this theorem for finite Abelian groups and then with the help of the class 
equation extend it to any finite group. The proof of Cauchy’s theorem given in 
this book is intended to show the reader the use of the ideas of quotient groups 
and the class equation. With the help of Cauchy’s theorem, we also prove that 
the converse of Lagrange’s theorem holds for finite Abelian groups. 

Lemma 7.2.1 If G is a finite commutative group of order n such that n is di¬ 
visible by a prim.e p, then G contains an element of order p (whence a subgroup 
of order p). 

Proof. The proof is by induction on the order of G. If \G\ = p, a prime, then 
every element of G, other than the identity, has order p. Thus, in particular, 
the lemma is true when |G| = 2. Now make the induction hypothesis that the 
lemma is true for all groups of order r, where 2 < r < n. Suppose G is a group 
of order n. Let a £ G with a e and let m, denote the order of a. Then either 
p\m or p/m. If p|m, then m = pk for some positive integer k. In this case, 
( a k ) p — a m = e, from which it follows that a k e and a k is an element of order 
p. Now suppose p/m. Since G is commutative, the cyclic subgroup H — (a) of 
G is of course a normal subgroup of G. Now \G\ = m, • [G : H). Since p does 
not divide m, we have p|[G : H]. Hence, p divides \G/H\ . Since \G/H\ < n, 
we have by the induction hypothesis that there exists bH £ G/H such that 
o (bH) = p. Now bPH = ( bH)* = H. Hence, IP £ H. Thus, (6 m )P = ( bP) m = e, 
so that either 6 771 = e or b m has order p. But b m e else ( bH) m — H yielding 
p|m, a contradiction. Thus, b m has order p and so b m is the desired element of 

G. m 

Theorem 7.2.2 (Cauchy) Let G be a finite group of order n such that n is 
divisible by a prim,e p. Then G contains an element of order p and hence a 
subgroup of order p. 

Proof. The proof is by induction on' n. If n = 2, then G is commutative 
and the result follows by Lemma 7.2.1. Make the induction hypothesis that 
the result is true for all groups of order m such that 2 < m < n. Consider the 
class equation 

\G\ = \Z(G)\+ Y. [C:C(a)] 

a$Z(G) 

for G. If G = Z(G), then G is commutative and the result follows by Lemma 
7.2.1. If (?/ Z(G), then there exists a £ G such that a Z(G). For such an 
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element a, G ^ C(a) and so [G : C'(a)] > 1, whence by Lagrange’s theorem 

\G\ = [G:C(a)]-\C{a)\>\C(a)\. 

If p divides |C(a)| , then by the induction hypothesis, C(a ) and thus G has 
an element of order p. If p does not divide | C(a)\ for all a (f Z{G), then p must 
divide [G : C(a )] for all a £ Z{G). But in the class equation, p divides each term 
of the summation and also divides |(7|. Thus, p divides |Z(G)| . Since Z{G) is 
commutative, we have again by Lemma 7.2.1 that there exists a € Z(G) and 
hence a 6 G of order p. ■ 

Next, we apply Cauchy’s theorem to prove that the converse of Lagrange’s 
theorem holds for finite commutative groups. 

Theorem 7.2.3 Let G be a finite commutative group of order n. If m is a, 
positive integer such that m|n, then G has a subgroup of order m. 

Proof. If m = 1, then {e} is the required subgroup of order m. If n = 1, 
then m — n — 1 and the result follows easily. We now assume that m > 1, 
n > 1 and prove the result by induction on n. If n = 2, then m = 2 = n and 
G is the required subgroup of order m. Suppose the theorem is true for all 
finite commutative groups of order k such that 2 < k < n. Let p be a prime 
integer such that p\m.. Then there exists an integer m,\ such that m = pm.\. By 
Cauchy’s theorem, G has a subgroup H of order p. Since G is commutative, H 
is normal and hence G/H is a group. Now 

1 < \G/H\ = M < |G| 

and \G/H\ = Now n = mm 2 for some positive integer m 2 . Thus, \G/H\ — 
pm^m 2 _ mirri2 shows that mi divides \G/H\. Hence, from the induction hy¬ 
pothesis, G/H has a subgroup K/H such that \K/H\ — mi, where if is a 
subgroup of G. Now \K\ — \K/H\ \H\ = mip — m. Hence, if is a subgroup of 
G of order m. ■ 


We now apply Cauchy’s theorem to obtain some interesting properties of 
p- groups. 

Definition 7.2.4 Let p be a prime. A group G is said to be a p -group if the 
order of each element of G is a power of p. A subgroup H of a group G is called 
a pi-subgroup if H is a p-group. 


Example 7.2.5 The group of symmetries of a square and the Klein 4-group 
are p-groups, where p — 2. In fact, any group of order p n (p a prim,e) is a 
p-group since the order of each element must divide the order of the group. 

~Pufuc- TfcafAewtafLca/ 
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The following theorem, gives a necessary and sufficient condition for a finite 
group to be a p-group. 

Theorem 7.2.6 Let G be a nontrivial group. Then G is a finite p-group if 
and only if |G| = p k for some positive integer k. 

Proof. Suppose G is a finite p-group. If q divides |(7| for some prime q 7 ^ p, 
then by Cauchy’s theorem G has an element of order q , contradicting the fact 
that G is a p-group. Thus, p is the only prime divisor of \G\ . Hence, |(7| = p k 
for some positive integer k. Conversely, suppose |(7| = p k . Then by Lagrange’s 
theorem, the order of each element of G is a power of p. ■ 

In the next theorem, we prove that the center of a p-group is nontrivial. 

Theorem 7.2.7 If G is a finite p-group with |(7| > 1, then Z(G ), the center 
of (7, has m.ore than one elem.ent, i.e., if |(7| = p k with k > 1, then \Z{G)\ > 1. 


Proof. Consider the class equation 

|G| = |Z(G)|+ £ [< G:C(a)\. 

a<£Z(G) 

If G = Z{G), then the theorem is immediate. Suppose G D Z(G) and 
consider a € G such that a ^ Z(G). Then C(a ) is a proper subgroup of G so 
that by Theorem 7.2.6 and by the fact that (7(a) is a subgroup of a p-group, 
p\[G : C(a)] for all a Z(G). This implies that p divides Yl a £Z(G)\G • (7(a)]. 
Since p also divides |(7|, p divides \Z(G) \. Hence, \Z(G)\ > 1. ■ 

Corollary 7.2.8 Let G be a group of order p 2 , where p is a prime. Then G is 
commutative. 

Proof. By Theorem 7.2.7, |Z(<7)| > 1. By Lagrange’s theorem, \Z(G)\ 
divides p 2 . Hence, |Z(<7)| = p or p 2 . Suppose |Z(G)| = p. Then Z(G) 7 ^ G 
and so there exists a £ G such that a ^ Z{G). Now (7(a) is a subgroup of 
G and a € (7(a). Hence, Z(G) C (7(a). This implies that | (7(a) | = p 2 and so 
G = (7(a). However, this shows that a 6 Z{G), a contradiction. Therefore, 
\Z(G) | = p 2 and so G = Z(G). Thus, G is commutative. ■ 

7.2.1 Worked-Out Exercises 

0 Exercise 1 Show that every group of order pg, where p and q are primes, 
is not simple. 
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Solution: If p = q, then G is a group of order p 2 . Hence, G is commutative. 
Also, Cauchy’s theorem implies that G has a subgroup of order p, which must 
be normal. Therefore, G is not simple. Suppose now p ^ q. We may assume 
that p > q. By Exercise 8 (page 200), G has a normal subgroup of order p. 
Thus, G is not simple. 

Exercise 2 Let H and K be subgroups of a commutative group G. Suppose 
\H\ = m and \K\ = n. Let d — lcm (m,n). Show that G has a subgroup 
of order d. 


Solution: Since G is commutative, HK is a subgroup of G , and since 
H and K are finite, HK is finite. Now H and K are subgroups of HK. 
Hence, m\ \HK\ and n\ \HK\ . This implies that d\ \HK\ . Since HK is a finite 
commutative group and d\ \HK\ , HK has a subgroup of order d and so G has 
a subgroup of order d. 

0 Exercise 3 Let Gbea noncommutative group of order p 3 , p a prime. Prove 
that \Z(G) \ — p. 

Solution: Write Z — Z(G). Since \G\ = p 3 , \Z\ > 1 by Theorem 7.2.7. 
Thus, \Z\ = p, p 2 or p 3 . If \Z\ = p 3 , then G — Z and so G is commutative, 
which is a contradiction. If \Z\ = p 2 , then \G/Z\ = p. Hence, G/Z is cyclic. 
But then G is commutative, again a contradiction. Thus, \Z\ = p. 

Exercise 4 Let G be a finite commutative group. Prove that the number of 
solutions of x n — e in G, where n > 0 and n divides \G\ , is a multiple of 
n. 


Solution: Let H = {x | x £ G, x n = e}. Then H is a subgroup of G. Since 
n divides |G| and G is commutative, there exists a subgroup K of G such that 
\K\ = n. Let a 6 K. Then a n = e. Hence, K C H. By Lagrange’s theorem, \K\ 
divides \H\ . Thus, \H\ — nm,. Consequently, the number of solutions of x n = e 
is a multiple of n. 


<0 Exercise 5 Let G be a group of order p n , pa prime, and n £ Z, n > 1. 
Prove that any subgroup of G of order p n ~ l is normal in G. 


Solution: We will prove the result by induction on n. If n = 1, then 
G is a cyclic group of prime order and hence every subgroup of G is normal 
in G. Thus, the result is true if n = 1. Suppose the result is true for all 
groups of order p m , where 1 < m, < n. Let H be a subgroup of order p n ~ l . 
Consider N(H). If H ^ N(H ), then \N{H) \ > p n -\ Thus, \N(H)\ = p n and 
so N(H) = G. Hence, in this case H is normal in G. Suppose H — N(H). 
Then Z(G), the center of G , is a subset of H and Z{G) ^ {e}. By Cauchy’s 
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theorem and Theorem 7.2.7, there exists a G Z(G) such that o(a) = p. Let 
K = (a) . Then K is a normal subgroup of G of order p. Now \H/K\ = p n ~ 2 
and \G/K\ = p n_1 . Thus, by the induction hypothesis, H/K is a normal 
subgroup of G/K. Hence, H is a normal subgroup of G. 

7.2.2 Exercises 

1. Show that every group of order 14 contains only one normal subgroup of 
order 7. 

2. How many elements of order 7 are there in a group of order 28? 

3. Show that a group of order 15 is commutative. 

4. Let G be a group of order p n , where p is a prime and n is a positive 
integer. Show that G contains a subgroup of order p 1 , 0 < i < n. 

5. Find all 2-subgroups and 3-subgroups of (Zi2,+i2)- 

6. Find all 2-subgroups of A±. 

7. Show that every commutative group of order 36 contains an element of 
order 6. 

8. Let G be a group of order pn, where p is a prime and p > n. Show that 
G contains a normal subgroup of order p. 

9. Let G be a commutative group of order pq, where p and q are distinct 
primes. Show that G is cyclic. Is this result true when p — g? 

10. For any prime p, prove that any group of order p 2 is either cyclic or a 
direct product of cyclic groups. 

11. Show that every group of order 28 with a unique subgroup of order 4 is 
commutative. 

12. Show that a group of order 81 contains a nontrivial normal subgroup with 
more than three elements. 

13. Let G be a group of order 99. Prove the following. 

(i) G has a unique normal subgroup H of order 11. 

(ii) H C Z(G). 

(iii) G has an element of order 33. 
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7.3 Sylow Theorems 

M.L. Sylow did work of fundamental importance in determining the structure 
of finite groups. We can use his results to answer the problem now posed. 

If G is a finite group of order n and if if is a subgroup of G , then we know 
by Lagrange’s theorem that the order of H divides n. In this section, we give 
some answers to the question, “If m is a positive integer, which divides n, does 
G contain a subgroup of order mV 

It is interesting to note that Sylow’s theorem was proved by Sylow for 
permutation groups. George Frobenius established the theorem in the general 
setting. He was influenced to do so by Cayley’s theorem. 

Theorem 7.3.1 (Sylow’s First Theorem) Let G be a finite group of order 
p r m, where p is a prim,e, r a,nd m, are positive integers, and p and m. are 
relatively prim,e. Then G has ai subgroup of order p k for all k, 0 < k < r. 

Proof. First Proof of Sylow’s First Theorem: Let |(7| = n = p T m. 
We prove the result by induction on n. If n — 1, then r = 0 and {e} is the 
required subgroup of order p r . Suppose the result is true for all groups T of 
order less than |G| . If r = 0, then {e} is the required subgroup of order p T . 
We now assume that r > 1. First suppose p divides \Z(G )\, where Z(G ) is the 
center of G. Since p divides \Z(G) \, there exists a E Z(G) such that o(a) = p 
by Cauchy’s theorem. Let H — (a). Then H is a normal subgroup of G since 
a E Z(G). Now \G/H\ — p T ~ l m„ Hence, by the induction hypothesis, G/H has 
subgroups Ki/H of order p 1 for all i = 0,2,..., r —1. Then {e}, H, K\, ..., Lf r - 1 
are the subgroups of G of the required order. 

Now suppose p / | Z(G) \ . Consider the class equation, 

\G\ = \Z(G)\+ £ [<?:C(a)], 

afZ(G) 

where the summation runs over a complete set (possibly empty) of distinct 
conjugacy class representatives which do not belong to Z(G). From the hy¬ 
pothesis, p divides |G|. If p\[G : C(a)] for all a £ Z(G ), then from the class 
equation, it follows that p divides \Z(G) \ , a contradiction to our assumption. 
Hence, there exists a 0 Z(G) such that p does not divide [G : C'(u)]. Now 

|G| = [G:C(a)].|C(a)j. 

This implies that p T divides |C(a)|. Since a 6 C(a), |C(a)| > 1 , Also, C(a) 
G since a £ Z(G). Hence, |(T(a,) | < |G| . Thus, by the induction hypothesis, 
C(a ) has a subgroup of order p 1 for all z, 0 < i < r. Hence, G has a subgroup 
of order p 1 for all i, 0 < i < r. I 
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Second Proof of Sylow’s First Theorem: If r = 0, then {e} is the 

required subgroup of order p r . Suppose r > 1. Since p\ |G|, G has a subgroup 
of order p by Cauchy’s theorem. We now show that if G has a subgroup 
of order p 1 , then G has a subgroup order p 2+1 , where 1 < z < r. Suppose 
G has a subgroup H of order p l , 1 < i < r. Then H is a proper subgroup 
of G. By Worked-Out Exercise 3 (page 176), [N(H) : H] = p [G : H] and 
H 7 ^ N(H). Since p\[G : H], it follows that p\[N(H) : H], i.e., p\ \N{H)/H\. 
Thus, N{H)/H has a subgroup K/H of order p by Cauchy’s theorem. Now 
\K\ = \K/H\ \H\ = pp l — p l+1 . Therefore, K is a subgroup of G of order p l+1 . 
The result now follows by induction. ■ 


The following corollary is immediate from Theorem 7.3.1 

Corollary 7.3.2 Let G be a finite group and p a prime. If p n divides \G\ , 
then G has a subgroup of order p n . ■ 


Definition 7.3.3 Let G be a finite group and p a prim.e. A subgroup P of G 
is called a Sylow p-subgroup of G, if P is a p-subgroup and is not properly 
contained in any other p-subgroup of G , i.e., P is a m,axima.l p-subgroup of G. 

Example 7.3.4 The symmetric group S 3 has three Sylow 2-subgroups, namely 


Hi 


123 ^/ 123^1 
1 2 3 y ’ \^ 2 1 3 y j ’ 


and 


H 2 


123\/l23\l 
1 2 3 y 5 y 3 2 l))' 


H 3 


1 2 3\/l 2 3\| 

1 2 3 y ’ y 1 3 2 y j * 


Thus, a Sylow p-subgroup of a given group need not be unique. 


The following theorem shows the existence of Sylow p-subgroups in a finite 
group. 


Theorem 7.3.5 For each prime p , a finite group G has a Sylow p-subgroup. 


Proof. If |G| = 1 or p does not divide \G\ , then {e} is the required Sylow 
p-subgroup of G. If p divides |G|, then by Cauchy’s theorem, there is at le >st 
one subgroup H of G of order p. Since G is finite, there are a finite number 
of subgroups of G, which contain H. Hence, one of these subgroups is a Sylow 
p-subgroup of G. ■ 
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From Sylow’s first theorem, every group of order p r m (p a prime, gcd(p, m) = 
1) contains a subgroup of order p r . We now show that every subgroup of order 
p T is a Sylow p-subgroup in G and every Sylow p-subgroup of G is of order p T . 

Theorem 7.3.6 Let G be a finite group of order p r m , where p is a prime, r 
and m are positive integers, and p and m are relatively prime. 

(i) Let H be a subgroup of G of order p z , 1 < i < r. Then there exists a, 
subgroup K of G such that \K\ = p t+1 and H is a norm.al subgroup of K. 

(ii) Let H be a subgroup of G. Then H is a Sylow p-subgroup of G if and 
only if |Jf | = p T . 

Proof. (i) By Worked-Out Exercise 3 (page 176), [. N(H ) : H] ~ p [G : H]. 
Since p\[G : if], pj \N(H)/H\. Thus, N(H)/H has a subgroup K/H of order 
p by Cauchy’s theorem. Now \K\ — \H\ \K/H\ — p l+1 . Since H is normal in 
N(H ) and K C N(H ), H is normal in K. Hence, K is the desired subgroup of 
G. 

(ii) Suppose if is a Sylow p-subgroup. Then H is a p-subgroup of G and 
so |if| — p k for some positive integer k. Suppose k ^ r. By (i), there exists a 
subgroup K of G such that H C K and \K\ = p k+1 . This implies that H is not 
a maximal p-subgroup of G , a contradiction. Thus, k — r. Conversely, suppose 
that |if | = p r . Since |G| = p T m and p and m are relatively prime, it follows 
that ii is a maximal p-subgroup of G. Hence, H is a Sylow p-subgroup of G. ■ 

Theorem 7.3.7 Let G be a finite group of order p r m , where p is a prim.e, 
r and m are positive integers, and p and m are relatively prime, and P be a 
subgroup of G. 

(i) If P is ap-group, then any conjugate of P is ap-group. 

(ii) If P is a Sylow p-subgroup, then any conjugate of P is a Sylow p- 
subgroup. 

(Hi) If P is the only Sylow p-subgroup of G, then P is a norm.al subgroup 
of G. 


Proof. (i) Since \P\ = |aPa _1 | and aPa~ l is a subgroup of G , the desired 
result follows from Theorem 7.2.6. 

(ii) Let P be a Sylow p-subgroup. Then |P| = p r . This implies that 
|aPa -1 | = p T and so by Theorem 7.3.6(h), aPa~ l is a Sylow p-subgroup. 

(iii) Let a E G. Then aPa~ l is a Sylow p-subgroup of G by (ii). Since P is 
the only Sylow p-subgroup of G, aPa~ l = P. Hence, P is a normal subgroup 
of G.m 


Lemma 7.3.8 Let H be a norm.al subgroup of a group G. If H and G/H are 
both p-groups, then G is a p-group. 
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Proof. Let a £ G. Then aH £ G/H and so aH has order some power of p, 

i k k 

say, p k . Thus, ( aH) p ~ H and so a p £ H. Now every element of H has order 

k kw 771 mA-k 

a power of p. Let us say a p has order p m . Thus, (aP > = e or aP = e. 

This implies that o(a) has order some power of p. Since a was arbitrary in G, 
G is a p-group. ■ 

Lemma 7.3.9 Let G be a finite group. Let P be a Sylow p-subgroup of G and 
a £ G be such that the order of a is a. power of p. If aPa~ l = P, then a £ P. 


Proof. Since aPa~ l = P, a £ N(P). Now N(P) D P, so if we show that 
no element of N(P)\P has order a power of p, then a £ P. Suppose there 
exists b £ 1V(P)\P such that the order of 6 is a power of p. Now P is a normal 
subgroup of iV(P) so that we may consider the quotient group N(P)/P and 
the coset bP. The order of bP as an element of N(P)/P divides the order of b. 
Hence, bP has order a power of p in N(P)/P. Thus, the cyclic subgroup { bP) 
of N((P)/P has order a power of p and thus is a p-group. By Corollary 5.2.12, 
there is a subgroup K of N(P) such that K D P and K/P = ( bP) . Since 
b P, K D P. By Lemma 7.3.8, K is a p-group since both P and (bP) are 
p-groups. However, this contradicts the fact that P is a maximal p-subgroup 
of G. Hence, no element of N(P)\P can have order a power of p. ■ 

We now prove two more theorems due to Sylow. 

Theorem 7.3.10 (Sylow’s Second Theorem) Let G be a finite group of 
order p r m , where p is a prime, r and m are positive integers, and p and m 
are relatively prime. Then any two Sylow p-subgroups of G are conjugate, and 
therefore isom,orphic. 


Proof. First Proof of Sylow’s Second Theorem: By Theorem 7.3.5, 
G has a Sylow p-subgroup, say, P. Let S be the set of all conjugates of P. We 
show that S contains all Sylow p-subgroups. Let if be a Sylow p-subgroup 
of G such that H ^ S and let Q £ S. Now Q is a Sylow p-subgroup of G 
and | Q\ = p T . Since Q H , it follows that Q % H. Thus, there exists h £ H 
such that h ^ Q. Now o(h) = p k for some positive integer k. By Lemma 7.3.9, 
hQh ~ 1 Q. Thus, the number of conjugates of Q induced by the elements 
of H is more than 1. Hence, by Theorem 7.1.12, [H : N^(Q)] > 1. Now 
p r — \H\ — [H : Nh{Q)\ \Nh(Q)\ and so [ H : Nh(Q )j is a positive multiple of 


p. 

Let us now define a relation p on S by p = {(T, B) £ S X S \ A = hBh~ l 
for some h £ H}. Then p is an equivalence relation on S and for all A £ S, the 
equivalence class, [A], consists of all conjugates of A induced by the elements 
of H. Thus, as shown before, |[A]| is a nonnegative multiple of p. Since S is 
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a disjoint union of such equivalence classes, it follows that |<S| is a positive 
multiple of p and so p\ |<S|. By Theorem 7.1.12, \S\ — [G : N(P)\. Thus, 

m = [G : P] = [G : N(P)][N(P ) : P] = |5| [N(P) : P}. 

From this, it follows that p\m, a contradiction. Hence, S is the set of all Sylow 
p-subgroups of G. ■ 

Second Proof of Sylow’s Second Theorem: Let H and K be Sylow p- 
subgroups of G and S be the set of all left cosets of H in G. Then |5j = [G : H]. 
Let K act on S by for all k E K, aH E S, 

k(aH) = (ka)H. 

Then S is a K- set. Let So = {aH E <5 | k{aH) = aH for all k E K}. By 
Worked-Out Exercise 1 (page 176), 


l<So|= P |S|. 


Since H is a Sylow p-subgroup of G, |<S| — [G : H] is not divisible by p. Thus, 
|So| 7 *“ 0. Let aH E (So- Then k{aH ) = aH for all k E K. From this, it follows 
that a~ l kaH = H for all k E K and so a~ l ka E H for all k E K. Therefore, 
a~ l Ka C H. Since \a~ l Ka\ = \K\ = \H \, a~ 1 Ka — H. Hence, H and K are 
conjugate. ■ 

The following corollary is an immediate consequence of Sylow’s second the¬ 
orem. 


Corollary 7.3.11 Let G be a finite group and H be a Sylow p-subgroup of 
G. Then H is a unique Sylow p-subgroup of G if and only if H is a norm.al 
subgroup of G. M 


Theorem 7.3.12 (Sylow’s Third Theorem) Let G be a finite group of or¬ 
der p r m , where p is a prim,e, r and m are positive integers, and p a,nd m are 
relatively prim,e. Then the number n p of Sylow p-subgroups of G is 1 + kp for 
som,e nonnegative integer k and n p \p r m. 


Proof. First Proof of Sylow’s Third Theorem: Let S be the set 

of all Sylow p-subgroups of G and P E S. Define a relation p on S by 
p = {(A,B) E S x S j A = aBa~ l for some a E P). Then as in the first 
proof of Sylow’s second theorem, p is an equivalence relation on iS and for 
all A E <S, A P, the number of elements in the equivalence class, [A], is a 
multiple of p. Now [P] = {A E S \ A = xPx~ l for some x E P} — {P}- Thus, 
|[P]| = 1. Consequently, |»S| = 1 + kp for some nonnegative integer k. Now by 
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Theorem 7.1.12, n p = |<S| = [G : N(P)]. This implies that n p divides \G \■ 

Second Proof of Sylow’s Third Theorem: Let S be the set of all Sylow 
p-subgroups of G and P £ S. Let P act on S by conjugation, i.e., for all a £ P, 
Q £ S, a ■ Q = aQa~ l . Let Sq = {Q £ S ] a ■ Q = Q for all a £ P} = {Q £ S \ 
aQa _1 = Q for all a £ P}. By Worked-Out Exercise 1 (page 176), 



Since P £ So, So ^ 4>- Let Q £ So. Then Q = aQa -1 for all a £ P. Hence, 
P Q IV(Q) and so P and Q are Sylow p-subgroups of N(Q ) since P and Q 
are Sylow p-subgroups of G. Thus, by Sylow’s second theorem, aQa~ l = P for 
some a £ N(Q). But then P = Q. Thus, So — {P} and so |«So| = 1. Hence, 
|«S| = p 1 and so |5| = 1 + kp for some integer k. 

Let G act on S by conjugation. By Sylow’s second theorem, any two Sylow 
p-subgroups are conjugate. Therefore, there is only one orbit of S under G. 
Let PeS. Then G P = {g e G \ g ■ P = P} = {g £ G \ gPg^ 1 = P} = N{P). 
Thus, by Lemma 5.4.6, 

|<Sj = number of elements in the orbit of P = [G : Gp\. 

But [G : Gp] divides |G| . Consequently, the number of Sylow p-subgroups of 
G divides |G| . ■ 


7.3.1 Worked-Out Exercises 

<(> Exercise 1 Show that every group of order 45 has a normal subgroup of 
order 9. 


Solution: Let G be a group of order 45 = 3 2 ■ 5 and 713 denote the number 
of Sylow 3-subgroups of G. Then 77,3 = 3fc+ 1 for some integer k > 0 and 723 |4h. 
If k = 0 , then 723 = 1 , which divides 45. But for any k > 1 , 723 does not divide 
45. Hence, G contains a unique Sylow 3-subgroup H of order 9. Consequently, 
G has a normal subgroup of order 9 . 


0 Exercise 2 Let G be a finite group of order p m g, where p and q are relatively 
prime, and P be a subgroup of order p m , where p is a prime. Show that 
P is the only Sylow p-subgroup of order p m lying in N(P). 


Solution: Clearly |1V(P)| = p m r for some r < q and p and r are relatively 
prime. Let P' be any other Sylow p-subgroup of G such that P' C N(P). Then 
P and P' are Sylow p-subgroups of N(P). Thus, there exists x £ N(P) such 
that P' = xPx~ l . Since P is normal in N(P ), P = xPx~ l . Hence, P' = P. 
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0 Exercise 3 Let G be a finite group and p a prime such that p divides |G| . 

(i) Let K be a normal subgroup of G. Show that for any Sylow p-subgroup 
P of G, PC\K is a Sylow p-subgroup of K. Conversely, if B is any Sylow 
p-subgroup of K , show that there exists a Sylow p-subgroup P of G such 
that B — P n K. 

(ii) Let H be a normal subgroup of G. If P is a Sylow p-subgroup of G, 
show that PH/H is a Sylow p-subgroup of G/H. Conversely, show that 
any Sylow p-subgroup of G/H is of the form PH/H, where P is a Sylow 
p-subgroup of G. 


Solution: (i) Let |G| = p m g, where p and q are relatively prime. Let P be a 
Sylow p-subgroup of G. Then \P\ = p m . Since \P fl K\ divides \P\ ,\P P\ K\ = p l 
for some i < m. Hence, P D K is a p-group. Let \K\ = p s t , where p and t are 
relatively prime and s >' i. Suppose s > i. Now |jPjF£T| = = p ? 1 = 

p^ppt = p m+ G j = s — i > 1, which is impossible since] G\ = p Tn q and PK is 
a subgroup of G. Thus, s = i. Hence, \P f) K\ = p s , i.e., P fl K is a Sylow 
p-subgroup of K. Conversely, let B be a Sylow p-subgroup of K. Let \K\ — p s t , 
where p and t are relatively prime. Then \B\ = p s . Now P fl K is a Sylow 
p-subgroup of K for any Sylow p-subgroup P of G. Then there exists a £ K 
such that B = a~ l (P D K)a = a~ l Pa n a~ l Ka = Q Pl K, where Q = a~ 1 Pa. 
Clearly Q is a Sylow p-subgroup of G. 

(ii) Let |G| = p m <p where p and q are relatively prime. Let P be a Sylow 
p-subgroup of G. Then |P| = p m . Let \H\ — p s t, where p and t are relatively 
prime. Now P fl H is a Sylow p-subgroup of H. Hence, |P n H\ = p s . Now 

— p m ~ s . Also, 


\PH/H\ = IHfl = = _1£L _ e: = 


\G/H\ = || = 


p s t 


1 = P 


\H\ ~ \H\\PnH\ ~ \PDH\ p> 

m_s r. Hence, PH/H is a Sylow p-subgroup of G/H. Conversely, let 


B/H be a Sylow p-subgroup of G/H. Now PH/H is a Sylow p-subgroup of 
G/H for any Sylow p-subgroup P of G. Therefore, there exists aH £ G/H 
such that B/H = a~ 1 H{PH/H)aH. Now for all b £ PH , a~ l HbHaH £ B/H , 


and hence for all b £ PH, a l ba £ B. Thus, a l {PH)a C B. Let Q = a l Pa. 
Then Q is a Sylow p-subgroup of G. Also, cT l Ha = H since H is normal. Now 
QH = (a- l Pa){a~ l Ha) = a~ l (PH)a C B. Let c £ B. Then cH £ B/H = 


a~ l H(PH/H)aH. Therefore, cH - a^HbHaH = a~ l baH for some b £ PH. 


,-L 


Let b = uv for some u £ P,v £ H. Then a -1 6a = a~ l uva = (a -1 ua)(a "va 
£ ( a~ 1 Pa)H = QH. Now cH — a~ l baH implies c~ l {a~ l ba) £ H C QH. 
Hence, c~ l £ QH or c £ QH. Thus, B = QH. 


{> Exercise 4 Let H be a normal subgroup of a finite group G and p be a 
prime dividing the order of G. If [G : H] and p are relatively prime, prove 
that H contains all Sylow p-subgroups of G. Show by an example that 
the result need not be true if H is not normal in G. 
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Solution: Let |G| = p k m , where p and m are relatively prime. Let 
\G/H\ = [G : H] = q. Then it follows that q\m. Thus, p k divides \H\ since 
|G| = q \H |. Hence, \H\ = p k r , where p and r are relatively prime. Let P be 
a Sylow p-subgroup of H. Then |P| = p k . Hence, P is a Sylow p-subgroup of 
G. If Q is any other Sylow p-subgroup of G, then there exists x E G such that 
Q ~ x~ l Px. Hence, Q = x~ l Px C x~ l Hx — H. 

Consider G = S 3 and let H = {e, (1 2)}. Then H is a subgroup of G, which 
is not normal. Now [G : H] = 3, p = 2 divides |G| . But H does not contain all 
Sylow 2 -subgroups of G. The Sylow 2-subgroups of G are {e, (1 2 )}, {e, (1 3)}, 
and {e, (2 3)}. 

0 Exercise 5 Show that a group of order 96 has a normal subgroup of order 
16 or 32. 

Solution: Let G be a group of order 96 = 2 5 -3. Let n 2 denote the number of 
Sylow 2-subgroups of G. Now 722 = 2&+1 for some integer k > 0 and n 2 divides 
96. Then 712 = 1 or 3. If 722 = 1, then G contains a unique Sylow 2-subgroup of 
order 32. This subgroup of order 32 must be a normal subgroup by Theorem 
7.3.7. Suppose 722 = 3. Then G has three Sylow 2-subgroups A, B, and G, 
each of order 32. Let us now show that \A fl B\ — 16. Since B and |A n B\ 
divides |A|, \A fl B\ = 1, 2, 4, 8, or 16. If \A fl B\ < 8, then \AB\ = shows 

that \AB\ > = 128 > 96 = |Gf, a contradiction. Hence, \AC\B\ = 16. 

Since [A : A D B] — 2 and [B : A fl B] = 2, A D B is a normal subgroup of 
A and B. Thus, A,B C N(A fl B ). Therefore, AB C N(A Pi B). This implies 
that | N(A fl B) | > \AB\ = = 64. Since N(A fl B) is a subgroup 

of G, it follows that |N(H fl B)\ = 96. Thus, N(A fl B) = G and so A fl B is a 
normal subgroup of G of order 16. 

0 Exercise 6 If a group G of order 52 contains a normal subgroup of order 
4, show that G is a commutative group. 

Solution: Suppose G contains a normal subgroup H of order 4. Then H is 
a commutative group. Now |G| = 13-4. Let 72 x 3 denote the number of Sylow 13- 
subgroups of G. Then 7213 = 13 k+ 1 for some integer k > 0 and 7213 divides 52. 

Thus, 72x3 = 1 and so G contains a unique Sylow 13-subgroup, say, A. Then A 
is a normal subgroup of order 13 and AC\H = {e}. Since \AH\ — = 52, we 

find that G = Ax H. Since A and H are both commutative, G is commutative. 

Exercise 7 Let G be a finite group. Suppose that every Sylow subgroup of G 
is normal in G. Prove that G is the internal direct product of its Sylow 
subgroups. 

Solution: Let |G| = p^p 7 ^ 2 where pi are distinct primes. Since 

every Sylow p-subgroup of G is normal, there exists a unique Sylow p-subgroup 
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for p = pi ( i = 1,2,... ,k). Let S(pi) be the Sylow ^-subgroup of G for all 

i. Then S(pi ) is a normal subgroup of G and S(p l ) D S(p 3 ) = {e} for all 
i 7 ^ j. Hence, a^j = ajCii for all a* € S(pi ) and a 3 € S(pj). Now consider, 
S(pi) n (S(pi ) • • • S(pi-i)S(pi+i) • ■ • S(p k )). Suppose 

a G Sfa) n (5(pi) • • • S(pi-i)S{p i+ i) ■ ■ • S{p k )). 

Then a G 5(p*) and a G S'(pi) • ■ • S(pi-i)S(pi + i) ■ ■ ■ S(pk). Hence, 

a -— uj • • • ai —• ■ ■ cik , 


where a 3 G S(pj). Now 


°(a)\p n l 'p?---p’lL-iPZV---P n k t 

and o(a)|p”b Consequently, o(a) — 1, i.e., a — e. Thus, 

S(pi) n • • • S(pi-i)S(p i+ i) • • • S{p k )) = {e}. 
This implies that IS'(pi) • • • S(pk)\ = p^p 7 ^ 1 * • -p£ fc = |G| and hence 


G = S( Pl )---S( Pk ). 

Thus, G = 5(pi) x S(p 2 ) x • • ■ x S(pk). 

7.3.2 Exercises 

1. Find the Sylow 3-subgroups of 54 . 

2. Prove that if G is a group of order p n , p a prime, then G contains a 
normal subgroup of order p l for every nonnegative integer t < n. 

3. Prove that a group G has only one proper subgroup if and only if G is a 
cyclic group of order p 2 for some prime p. 

4. Prove that for any group G. \G/Z(G) \ 7 ^ 91. 

5. Let G be a finite group and P be a Sylow p-subgroup of G. Let H be a 
subgroup of G such that Nq(P) C H. Prove that Nq(H ) = H. 

6 . Let G be a finite group, P and H be subgroups of G such that P is a 
normal subgroup of H, and H is a normal subgroup of G. Show that if 
P is a Sylow p-subgroup of G , then P is a normal subgroup of G. 

7. Let G be a group of order 143. Show that Sylow 11 -subgroup of G is 
unique. Also, show that G is cyclic. 
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8 . Let G be a finite group, H be a normal subgroup of G , and P be a Sylow 
p-subgroup of H. Show that G = HNg{P)- 

9. Let G be a finite commutative group. Show that G is the internal direct 
product of its Sylow subgroups. 

10. Let G be a finite group and K be a normal subgroup of G. If K is a 
p-subgroup, prove that K is contained in every Sylow p-subgroup of G. 

11 . Let G be a finite group and suppose |G| = p k n , where p is a prime and p 
and n are relatively prime. Prove that every p-subgroup of G is contained 
in some Sylow p-subgroup of G. 

12. Let G be a group such that \G\ = p m , where p is a prime. Let H be a 
proper subgroup of G. Prove that there exists a € G, a £ H such that 
aHa - 1 = H. 

7.4 Some Applications of the Sylow Theorems 

We recall that a group G ^ {e} is called simple if it has no nontrivial normal 
subgroups. If G is commutative, then it follows from Lagrange’s theorem that 
G is simple if and only if G is of prime order. In Galois’s mathematical legacy 
to us, he wrote in a letter to a friend on the eve of his death stating that the 
alternating group A 5 is the smallest noncommutative simple group. William 
Burnside conjectured in 1911 that no noncommutative simple group of odd 
order exists. The mathematicians John Thompson and Walter Feit proved in 
1963 that Burnside’s conjecture was true. John Thompson received the Fields 
Medal for his work on this and other problems. 

In this section, we apply the Sylow theorems to determine some finite groups 
which are not simple. 

Example 7.4.1 Let G be a group of order 10. Now 10 = 5 • 2. Let n 5 denote 
the number of Sylow 5-subgroups of G. From, Sylow Theorem, 7.3.12, n§ — 5k + l 
for som,e integer k > 0 and n 5 divides |G| = 10. Thus, n$ = 1 and so there 
exists only one Sylow 5-subgroup, say, H in G. Since H is a unique Sylow 
5-subgroup, H is a normal subgroup of G by Corollary 7.3.11, proving that G 
is not simple. Thus, no group of order 10 is simple. 

Example 7.4.2 Let G be a, group of order 9. Then G is a, p-group, where 
p — 3. From, Theorem, 7.2.7, we fi,nd that Z(G) ^ {e}. If G — Z(G), then G is 
a commutative group. But commutative simple groups are precisely groups of 
prim,e order. Hence, in this case G is not simple. Suppose Z(G) 7 ^ G. Then 
Z(G) is a nontrivial norm.al subgroup of G. Thus, we find that a group of order 
9 is not a, simple group. 
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In Example 7.4.2, we showed that a group of order 9 = 3 2 is not simple. 
In the next theorem, we prove that, in general, if G is a p-group of order p n , 
n > 1, then G is not simple. 

Theorem 7.4.3 Let p be a prime integer and n > 1 be any integer. Then no 
group of order p n is simple. 


Proof. Let G be a group of order p n . Consider the center Z(G ) of G. 
From Theorem 7.2.7, it follows that Z{G) ^ {e}. If G — Z(G ), then G is a 
commutative group. If G is simple, then \G\ is prime, which is a contradiction. 
Thus, in this case G is not simple. Suppose Z(G) ^ G. Then Z(G) is a 
nontrivial normal subgroup of G, proving that G is not a simple group. ■ 

Theorem 7.4.4 Let p and q be two prime integers. Then no group of order 
pq is simple. 


Proof. Let G be a group of order pq. If p = q, then |(C| = p 2 and so by 
Theorem 7.4.3, G is not simple. Suppose now p ^ q. Let p > q. Let n p denote 
the number of Sylow p-subgroups of G. Then n p = pk + 1 for some integer 
k > 0 and n p divides pq. Since gcd(l + kp , p) — 1, n p does not divide p. Hence, 
n p divides q. Thus, 1 + kp < q. But p > q. Therefore, 1 4- kp < q holds only if 
k = 0. This implies that n p = 1 and so G contains a unique Sylow p-subgroup 
of order p, which must be normal by Corollary 7.3.11. Hence, G is not simple. ■ 


At this point let us recall the following result established in Worked-Out 
Exercise 5 (page 177). 

In a group G of order pn, where p is a prime and p > n, if H is a subgroup 
of order p, then H is a normal subgroup. Now from Cauchy’s theorem, any 
group of order pn, p prime, contains a subgroup of order p. Consequently, G 
contains a normal subgroup of order p. 

Let G be a group of order n < 60. Applying the above result, we find that 
if n= 6 (= 3-2), 10 (=5-2), 14 (=7-2), 15 (-5-3), 20 (-5-4), 21 (=7-3), 
22 (= 11-2), 26 (= 13 ■ 2), 28 (= 7 • 4), 33 (= 11 • 3), 34 (= 17 • 2), 35 (= 7 • 5), 
38 (= 19-2), 39 (= 13-3), 42 (= 7-6), 44 (= 11-4),46 (= 23-2), 51 (= 17*3), 
52 (= 13-4), 55 (= 11-5), 57 (= 19-3), or 58 (= 29-2), then G is not simple. 

In Worked-Out Exercise 7 (page 178), we have established that any group 
of order 2n, where n is an odd integer, contains a normal subgroup of order n. 
Using this result, we find that no groups of order 6 (= 2 - 3), 18 (= 2 ■ 9), 50 
(= 2 ■ 25), 54 (= 2 • 27), are simple. 

Next, let us recall the following result established in Corollary 5.4.10. Let 
G be a finite group and H a proper subgroup of G of index n such that |G| 
does not divide n! Then G contains a nontrivial normal subgroup. 
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Now suppose G is a group of order 12 = 2 2 • 3. From Theorem 7.3.6, we 
find that G contains a Sylow 2 -subgroup H of order 4. Thus, the index of H is 
3. Now |G| = 12 does not divide 3! Therefore, G contains a nontrivial normal 
subgroup, proving that G is not simple. Proceeding this way with the help of 
the above result, we can show that no group of order 24 (= 2 3 * 3), 36 (= 3 2 *4), 
45 (= 3 2 • 5), or 48 (= 2 4 ■ 3) is a simple group. 

Example 7.4.5 In this example, we show that no group of order 40 is simple. 
Let G be a group of order 40 = 5 • 8 . Let n$ denote the number of Sylow 5- 
subgroups of G. By Sylow Theorem 7.3.12, ns = 5k + 1 for some integer k > 0 
and divides 40. Hence, n 5 — 1 . Thus, G has a unique Sylow 5-subgroup 
which must be normal by Corollary 7.3.11. Hence, G is not simple. 

Example 7.4.6 In this example, we show that no group of order 56 is simple. 
Let G be a group of order 56 = 7 • 2 3 . Let nj denote the number of Sylow 
7-subgroups a,nd n 2 denote the number of Sylow 2 -subgroups of G. By Sylow’s 
third theorem (Theorem, 7.3.12), nj = 7m + 1 and n 2 — 2k + 1 for som,e 
integers m, k > 0. Now n-j divides 56. Thus, nj = 1 or 8. If nj = 1 , then G 
has a unique Sylow 7-subgroup which m,ust be normal. Hence, G is not simple. 
Suppose n 7 = 8 . Then G has eight Sylow 7-subgroups A\, A 2 ,... ,Ag. Now 
| A{\ — 7, i — 1 , 2,..., 8 . Also, AiC\Aj — {e} fori ^ j and for all a 7 ^ e, a 6 Ai, 
o (a) = 7. Thus, G contains 48 elements of order 7. Now — 1 or 7. If n 2 = 1, 
then G has a unique Sylow 2 -subgroup which must be normal. Hence, G is not 
simple. Suppose n 2 = 7. Then G has seven Sylow 2 -subgroups B\,B 2 , ■ ■ ■ , B 7 . 
Each Bi contains eight elements. Since B\ B 2 , \Bi D # 2 ! <4. This implies 
that B\ U B 2 contains at least 12 elements, none of which is of order 7. Hence, 
|G| > 48 + 12 = 60, a contradiction. Thus, we find that either n 7 = 1 orn 2 = 1, 
showing that G has either a norm.al subgroup of order 7 or a norm.al subgroup 
of order 8. Consequently, G is not simple. 

In Worked-Out Exercise 1 (page 216), we show that a group of order 30 
is not simple. By Theorem 7.4.3, no group of order 4 = 2 2 , 8 = 2 3 , 9 = 3 2 , 
16 = 2 4 , 25 = 5 2 , 27 = 3 3 , 32 = 2 5 , or 49 = 7 2 is simple. We now summarize 
the above results. 

Theorem 7.4.7 Let n be an integer such that 1 < n < 60 and n is not prim.e. 
Then no group of order n is simple. ■ 

Let us now concentrate our discussion on n = 60. Since 60 is not prime, no 
commutative group of order 60 is simple. Now what is the answer if G is a 
noncommutative group of order 60? Recall that A$ is a simple group of order 
60. Hence, we find that there exists a noncommutative simple group of order 
60. Next, let us ask the following question. Is A$ the only (up to isomorphism) 
noncommutative simple group of order 60? To answer this question, we first 
prove the following result. 
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Lemma 7.4.8 Let G be a simple group of order 60. Then G contains a sub¬ 
group of order 12. 

Proof. Suppose G has no subgroup of order 12. Now |(7| = 60 — 5 • 3 • 2 2 . 
Let ns denote the number of Sylow 5-subgroups and 77,2 denote the number 
of Sylow 2-subgroups of G. By Sylow Theorem 7.3.12, 77,5 — 5 m + 1 for some 
integer m > 0 and 715 divides 60. Thus, 715 = 1 or 6 . Since G is simple, 
77.5 7 ^ 1. Hence, 775 = 6. Then G has six Sylow 5-subgroups Ai, A 2 , ■ -., Aq. Now 
\Ai\ = 5, i = 1, 2 ,..., 6 . Also, Ai fl Aj = {e} for i ^ j and for all e 7 ^ a 6 A*, 
o(a) = 5. Thus, G contains 24 elements of order 5. Now 77,2 = 1, 3, 5, or 15. 
Since G is simple, 77,2 7 ^ 1. Suppose 77,2 = 15. Let Bi , i = 1, 2, ..., 15, be the 
15 Sylow 2-subgroups of G. If Bi fl Bj = {e} for 1 < i 7 ^ j < 15, then U \=\Bi 
contains 46 elements of order not equal to 5. Hence, 60 = |CJ| > 24 + 46 — 70, 
a contradiction. Therefore, there exist i,j such that Bi Pi Bj 7 ^ {e}. Then 
|Bi fl Bj | = 2. This implies that Bi D Bj is a normal subgroup of Bi and Bj. 
Thus, Bi , Bj C N(Bi fl Bj) and so B{Bj C N(Bi fl Bj). Hence, \N(B t n Bj)\ > 
\BiBj\ — 8 . Since N(Bi fl Bj) is a subgroup of G and \N(Bi C\ Bj)\ > 8 , it 
follows that | N(Bi D Bj )| = 12, 20, 30, or 60. Now \N(Bi n Bj )| 7 ^ 30 for then 
N(Bi D Bj) is normal in G. Also, from our assumption, \N(Bi C Bj )| 7 ^ 12. If 
| N(Bi D Bj )| = 20, then from Corollary 5.4.10, G contains a nontrivial normal 
subgroup, which is a contradiction. Hence, \N(Bi fl Bj)\ = 60, proving that 
Bi D Bj is a normal subgroup of G, which is also a contradiction. Suppose 
77.2 = 3 or 5. Let B be a Sylow 2-subgroup of G. Then 1 -I- 2 k = n<i — [G ; 
jV(f?)]. Thus, N{B) 7 ^ B and so |iV(B)| 7 ^ 4. But 4 divides |1V(B)| and |AT(B)j 
divides 60. Hence, |1V(B)| = 12,20, or 60. Proceeding as above, we again get a 
contradiction. Consequently, G must contain a subgroup of order 12. ■ 

Theorem 7.4.9 Any simple group of order 60 is isomorphic to A$. 

Proof. Let G be a simple group of order 60. By Lemma 7.4.8, G contains 
a subgroup H of order 12 . Since [G : H] = 5, it follows that there exists a 
nontrivial homomorphism / : G —» 65 such that Ker / C H by Corollary 5.4.9. 
Since G is simple, Ker f = {e}. Hence, G is isomorphic to a subgroup, say, T, 
of £ 5 . We show that T = A$. This will follow if we can show that T does not 
contain any odd permutation. Suppose T contains an odd permutation. Then 
the set of all even permutations is a normal subgroup of T of index 2. This im¬ 
plies that the group G, which is isomorphic to T, contains a nontrivial normal 
subgroup, a contradiction. Therefore, T C A 5 . But 60 = \G\ = \T\ — \A^\ . 
Consequently, T = A$ and so G ~ A$. ■ 

From Theorem 7.4.9, it follows that A 5 is the smallest noncommutative 
simple group. 
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The complete classification of simple groups was given in 1981. Hundreds of 
mathematicians contributed to this outstanding accomplishment. Two major 
contributors other than Thompson and Feit were M. Aschbacher and R.L. 
Griess. Certain troublesome groups appeared in the classification of simple 
groups. The largest of these sporadic groups was constructed by Griess. This 
group, known as the monster, has order approximately 8 x 10 53 . Other names 
associated with the determination of simple groups are Emil Mathieu (1838— 
1890), F.N. Cole (1861-1927), G.A. Miller, Leonard Eugene Dickson (1874- 
1954), Jean Dieudonne, Claud Chevalley, Richard Brauer, F.A. Fowler, Daniel 
Gorenstein, and John H. Conway. 

Let us now apply the Sylow theorems to classify some groups of small order. 

Example 7.4.10 Let G be a group of order 15 = 5-3. By Sylow’s third theorem, 
(Theorem, 7.3.12), G has a Sylow 5 -subgroup A a,nd a Sylow 3-subgroup B. It 
is easy to check that A is a unique Sylow 5-subgroup and B is a unique Sylow 
3 -subgroup of G. Hence, A is a norm,al subgroup of order 5 and B is a normal 
subgroup of order 3. Now A Fl B = {e}. Thus, \AB\ — = 15. Hence, 

G = AB, A Pi B — {e}, and A a,nd B are norm,al subgroups of G. Thus, 
G = AxB~Z 5 xZ 3 ~ Z 15 since gcd(3, 5) = 1. Hence, G is a, cyclic group. 

In the next theorem, we classify all groups of order pq , where p and q are 
distinct primes. 

Theorem 7.4.11 Let G be a group and p, q be prim,es with p > q. If \G\ — pq, 
then G is either cyclic or generated by two elements a and b satisfying the 
following properties: IP — e, a q = e, and o _ 1 6 a = b T , where p does not divide 
(r — 1), but p\(r q — 1). The second possibility can occur only if q\(p — 1). 


Proof. By Cauchy’s theorem, G contains an element b of order p. Set 
P = (b). Since P is a Sylow p-subgroup of G , it has 1 + mp conjugates for 
some nonnegative integer m„ Now 1 + mp = [G : N(P)\, which divides |G| = pq. 
Since 1 + mp and p are relatively prime, (1 + m,p)\q. However, q < p so that 
77i = 0. Hence, P is a normal subgroup of G. 

Now G contains an element a of order q. Set S = (a) . Then S' is a Sylow 
g-subgroup of G. Hence, [G : iV(S)] = 1 + kq for some nonnegative integer k. 
As above, 1 + kq divides p. Thus, either k = 0 or q\(p — 1). If k = 0, then S is 
a normal subgroup of G so that G ~ P x S. That is, G ~ Z p x Z g ~ Z pg . 

Suppose q\{jp — 1). Then S is not a normal subgroup of G. However, since 
P is a normal subgroup of G , a~ l ba = b r for some integer r. We may assume 
p J((r — 1) else we return to the commutative case. By induction on j, it follows 
that a _J 6a J = b r3 . In particular, if j = q, we have b — b rq so that p\(r q — 1). ■ 


Corollary 7.4.12 Let G be a group of order pq, p and q be prim.es with p > q. 
If q does not divide p — 1, then G is cyclic. ■ 



7.4. SOME APPLICATIONS OF THE SYLOW THEOREMS 


215 


In Chapter 5, we defined and studied D 4 , the dihedral group of degree 4. 
Let us now define the dihedral group D n of degree n > 3. 

Definition 7.4.13 A group G is called a. dihedral group of degree n > 3 if 
G is generated by tuio elements a, b such that 
(i) o(a) = n, b 2 = e, and 
(it) ba = a~ 1 b. 

We denote a dihedral group of degree n > 3 by D n . 

Example 7.4.14 Consider the symmetric group S n (n > 3). The subgroup G 
generated by 


a = (1 2 3 ••• n), 

12 3 i 

1 n n — 1 n + 2 — i 

is an example of a dihedral group of degree n. 

We leave the proof of the following theorem as an exercise. 

Theorem 7.4.15 Let G be a dihedral group of degree n > 3. Then G has 2 n 
elements. ■ 

Theorem 7.4.16 Let G be a group and p be an odd prim.e. If \G\ = 2 p, then 
G is either cyclic or dihedral. 




Proof. By Cauchy’s theorem, G contains an element a of order p and an 
element b of order 2. Let H = (a) . Then H is a normal subgroup of G since 
[G : H] = 2. Now bob — bab~ l E H. Hence, there exists a 1 E H such that 
bab — a l , where 0 < i < p. Now a? = (a 2 ) z = ( bab) l = (bab ^ 1 ) 1 = balb. 
A|ain from bab = a l , we find that a = ba l b. Hence, a = a 1 . This implies that 
a 1 — e. Since o(a) = p, it follows that p\(i 2 — 1 ). Therefore, p\(i — 1) or 
p\(i + 1) since p is prime. Suppose p\(i — 1). Then i — 1 = 0, i.e., i — 1. Thus, 
bab = a, which implies ba = ab. So in this case, we find that G contains an 
element of order 2 p and so G is a cyclic group. If p\(i + 1 ), then bab = a~ l . 
Hence, G is generated by a.b such that o(a) = p, 0 ( 6 ) = 2, and ba = a~ l b. In 
this case, G is the dihedral group D p . ■ 


Let us now classify groups of order n < 10. 

Let G be a group of order n < 10 . If n = 1 , then G = {e} and thus is 
cyclic. If n = 2,3, 5, or 7, then G is of prime order and hence cyclic. For n = 4, 
we know* that G is isomorphic to either Z 4 or Z 2 x Z 2 . If n = 6 , then G is 
isomorphic to either Zq or S 3 ~ D 3 . For n — 8 , if G is noncommutative, then 
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G is isomorphic to either D 4 or Qg.If G is commutative, then in Chapter 9 we 
will show that G is isomorphic to either Zs, Z 4 x Z 2 , or Z 2 x Z 2 x Z 2 . 

Let us now consider the case n = 9. Then G has order 3 2 . Since 3 is prime, 
G is commutative. Let e ^ a 6 G. Then o(a) = 3 or 9. If o(a) = 9, then 
G ~ Z 9 . Suppose G has no elements of order 9. Then o(a) = 3. Let H = {e, a, 
a 2 }. Then H is a subgroup of G and \H\ = 3. Let 6 € G be such that b £ H. Let 
K = {e, 6, fe 2 }. Now H and K are normal subgroups of G, H fl K — {e}, and 
G = HK. Hence, G = H x K c=: Z 3 X Z 3 . Thus, either G ~ Z 9 or G ~ Z 3 x Z 3 . 

Suppose now n = 10. Then from Theorem 7.4.16, it follows that either 
G ~ Z 10 or G ~ £> 5 . Hence, there are (up to isomorphism) two distinct groups 
of order 10 . 

We summarize the above discussion in the following table: 


Order of the group Number of Groups 


1 1 

2 1 

3 1 

4 2 

5 1 

6 2 

7 1 

8 5 

9 2 

10 2 


Groups 
{e} = Z 0 

Z 2 

Z 3 

Z 4 , Z 2 x Z 2 

Z 5 

Z6,^3 

Z 7 

Zs, Z 4 x Z 2 , Z 2 x Z 2 x Z 2 , D 4 , Qs 
Z 9 , Z 3 x Z 3 

Z 10 , 


In the Worked-Out Exercises below, we illustrate several techniques that 
can be effectively used to find the Sylow subgroups of a group. 


7.4.1 Worked-Out Exercises 

<v> Exercise 1 Let G be a group of order 30. Show that G is not simple. 


Solution: Since |G| = 30 = 2 • 3 • 5, G has a Sylow 2-subgroup, a Sylow 3- 
subgroup, and a Sylow 5-subgroup. Consider Sylow 5-subgroups. The number 
of Sylow 5-subgroups is 1 + 5fc, where 1 + 5fc|6. Thus, k = 0 or 1. If k = 0, then 
G has only one Sylow 5-subgroup, and hence this unique Sylow 5-subgroup 
must be normal in G. Therefore, in this case, G is not simple. Suppose k = 1 . 
Then G has six distinct Sylow 5-subgroups, say, H 1 , H 2 , ■ ■ ■, H§. Now for i / j, 
\HiC\Hj \ = 1 since H t C\H 3 is a subgroup of Hi. Thus, the six Sylow 5-subgroups 
contain 24 distinct elements of order 5. Now consider Sylow 3-subgroups. The 
number of Sylow 3-subgroups is 1 + 3Aq, where 1 + 3Aq|10. Thus, = 0 or 3. If 
k\ = 0, then G has a unique Sylow 3-subgroup, which must be normal in (7, and 
hence, in this case, G is not simple. Suppose k\ = 3. Then G has 10 distinct 
Sylow 3-subgroups. As in the case of Sylow 5-subgroups, we conclude that if 
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k i=3, then G has 20 distinct elements of order 3. Thus, |C7| > 24 + 20 = 44, 
a contradiction since G has only 30 elements. Hence, if k = 1, then k\ — 0. 
Thus, G either has a Sylow 5-subgroup normal in G or a Sylow 3-subgroup 
normal in G. 


0 Exercise 2 Let G be a group of order 36. Prove that G is not simple. 

Solution: (We have already established that a group of order 36 is not 
simple on page 212 . Our objective here is to show some different techniques that 
can be used in other cases.) Since |(7] = 36 = 2 2 ■ 3 2 , G has a Sylow 3-subgroup 
of order 9. The number of Sylow 3-subgroups is 1 + 3 k, where (1 + 3fc)|4. Thus, 
A: = 0 or 1 . If /c = 0, then G has only one Sylow 3-subgroup which must be 
normal in G. Suppose k = 1 . Then G has four distinct Sylow 3-subgroups, 
say, Hi, if 2 , # 3 , # 4 . Consider Hi and H 2 . Now H 1 D H 2 is a subgroup of H\ 
(and also of H 2 ). Since \H\\ = 9 and the order of H\ n H 2 divides the order 
of H\, |iii nif 2 | = 1,3, or 9. If \H\ Pi i/ 2 | = 9, then Hi = H 2 , which is a 
contradiction. Suppose \H\ fl H 2 \ = 1. Then \HiH 2 \ = = ^ = 81, i.e., 

H 1 H 2 has 81 elements, which is a contradiction since G has only 36 elements. 
Hence, \H± fl jH 2 | = 3. By Worked-Out Exercise 5 (page 199), Hi fl i / 2 is a 
normal subgroup of Hi and HT Therefore, H\, H 2 C N(H\ fl -H 2 )- As before, 
H 1 H 2 has 27 elements since H\ Pi H 2 has three elements. Thus, N(H\ fl 7f 2 ) 
has at least 27 elements. Since N(Hi fl if 2 ) is a subgroup of G, the order of 
N(Hi Pi H 2 ) divides the order of G. Therefore, \N{H\ Dif 2 )| = 36 and so 
N(H\ n H 2 ) = G. Hence, Hi fl H 2 is a normal subgroup of G and so G is not 
simple. 


0 Exercise 3 Let G be a group of order 231 = 3-7-11. 

(i) Show that a Sylow 11-subgroup of G is normal in G. 

(ii) Show that a Sylow 7-subgroup of G is normal in G. 

(iii) Show that G has a cyclic subgroup of order 77. 

(iv) Let H be a Sylow 11-subgroup of G, K be a Sylow 7-subgroup of G, 
and L be a Sylow 3-subgroup of G. Show that G = HKL. 

(v) Show that H C Z{G). 

Solution: By Theorem 7.3.5, G has a Sylow 11-subgroup, a Sylow 7- 
subgroup, and a Sylow 3-subgroup. 

(i) The number of Sylow 11-subgroups is 1 + 11/c, where (1 + ll/c)|3 ■ 7. 
Hence, k = 0 and so the number of Sylow 11-subgroups is 1. Let H be the 
Sylow 11-subgroup of G. Since H is a unique Sylow 11-subgroup of G, H is 
normal in G. 
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(ii) The number of Sylow 7-subgroups is 1 + 7 k, where (1 + 7/c) [3-11. Hence, 
k = 0 and so the number of Sylow 7-subgroups is 1. Let K be the Sylow 7- 
subgroup of G. Since K is a unique Sylow 7-subgroup of G , K is normal in 
G. 

(iii) Since H and K are normal subgroups of G, HK is a normal subgroup 
of G. Now H n K — {e}. Thus, \HK\ — 7 • 11 = 77. Since H and K are 
subgroups of order 11 and 7, respectively, H and K are cyclic groups. Note 
that gcd(7,11) = 1. Hence, HK is a cyclic group of order 77. 

(iv) Let L be a Sylow 3-subgroup of G. Then L D ( HK ) — {e} since non¬ 
identity elements in L are of order 3 and nonidentity elements in HK are of 
order 7,11, or 77. Now 


HKL\ = 


\HK\-\L\ 
\l n (HK) 


TL-A = 231 = |£?|. 


Hence, G = HKL. 

(v) Since H and K are normal subgroups of G and H D K = {e}, hk = kh 
for all h E H, k E K. Now \G/K\ =3-11. Thus, G/K is a cyclic group and 
hence G/K is commutative. Let a £ L and b E H be nonidentity elements. 
Then a,b £ K. Since G/K is commutative, ( aK){bK ) = (bK)(aK) or ( ab)K = 
( ba)K. Hence, ( ab)~ l {ba ) E K and so b~ l a~ l ba E K. Since H is a normal 
subgroup of G and b E 77, 6 -1 a _1 6a E H. This implies that b~ l a~ 1 ba E H n 
K = {e}. Hence, 6 -1 a -1 6a = e and so ba — ab. Let x E G and h E H. 
Now G = HKL and so x = abc for some a E H, b E K, and c E L. Now 
xh = ( abc)h = ab(ch) = ab(hc) = a{bh)c = a(hb)c = ( ah)bc = ( ha)bc = hx. 
Therefore, h E Z(G). Hence, H C Z{G). 


0 Exercise 4 Let G be a group of order 255. Show that G is cyclic. 


Solution: Now [G| = 255 = 3-5-17. Let H be a Sylow 17-subgroup of G. 
The number of Sylow 17-subgroups is l + 17m, where l + 17m|15. Hence, m = 0 
and so G has a unique Sylow 17-subgroup. Thus, H is a normal subgroup of 
G. Let if be a Sylow 5-subgroup of G and L be a Sylow 3-subgroup of G. The 
number of Sylow 5-subgroups is 1 + 5 k, where 1 + 5/c[51. Hence, k — 0 or 10. 
The number of Sylow 3-subgroups is 1 + 3/, where 1 + 3/|85. Therefore, l = 0 
or 28. Suppose k = 10 and l = 28. Then G has 51 Sylow 5-subgroups and 85 
Sylow 3-subgroups. Hence, in this case G would have 51 • 4 = 204 elements of 
order 5 and 85 ■ 2 = 170 elements of order 3. This is absurd since G has only 
255 elements. Thus, either k = 0 or l = 0. 

Case 1. k = 0. Then K is the unique Sylow 5-subgroup of G and so K is 
normal in G. Now H D K = {e}. Hence, xy = yx for all x E H and y E K. 
Now \G/K\ = 3 • 17. Since 3 does not divide (17 — 1), G/K is cyclic and hence 
commutative. Let a E H and b E L. Since G/K is commutative, aba~ l b~ l E K. 
Since H is normal and a E H, aba~ 1 b~ 1 E H. Hence, aba~ l b~ l E HDK = {e}. 
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Thus, ab — ba for all a £ H and b £ L. Clearly G = HKL. Since H is cyclic, 
H is commutative. Therefore, we have (i) H is commutative, (ii) xy = yx for 
all x £ H and y £ K, and (iii) ab = ba for all a £ H and b £ L. This implies 
that H C Z{G). Hence, \Z{G)\ = 17, 51, 85, or 255 and so \G/Z(G)\ = 15, 5, 
3, or 1. In either case, G/Z{G) is cyclic and hence G is commutative. Thus, G 
has a unique Sylow 3-subgroup. Since G = HKL and H , K , and L are normal 
subgroups of G, G is a direct product of cyclic groups such that the order of 
any two factors is relatively prime and hence G is cyclic. 

Case 2. / = 0. This case is similar to Case 1. 

<0> Exercise 5 Let G be a group of order 455. Show that G is cyclic. 

Solution: Now |G| = 455 = 5-7-13. Let H be a Sylow 13-subgroup of G. 
The number of Sylow 13-subgroups is 1 + 13 k, where 1 + 13A;|35. Hence, k = 0 
and so G has a unique Sylow 13-subgroup. Thus, H is a normal subgroup of G. 
Hence, N(H) = G. Now |Aut(if)| = 12. Since N(H)/C(H ) ~ to a subgroup 
of Aut(tf), \N(H)/C(H)\ divides 12. Also, \N{H)/C(H)\ divides 455. Hence, 
\N(H)/C(H)\ = 1 and so G = N(H ) = C(H). Thus, H C Z(G). This implies 
that \Z(G)\ — 13, 65, 91, or 455. Hence, \G/Z{G)\ = 35, 7, 5, or 1. In either 
case, G/Z{G) is cyclic and hence G is commutative. It now follows that G has 
a unique Sylow 5-subgroup, say, K, and a unique Sylow 7-subgroup, say, L. 
Clearly G = H x K x L. Since H, K, and L are cyclic groups of prime order 
and their orders are relatively prime to each other, G is cyclic. 

7.4.2 Exercises 

1. Show that every group of order 20,28,36,48, or 56 contains a nontrivial 
normal subgroup. 

2. Show that no group of order 125 is simple. 

3. Show that no group of order 65 is simple. 

4. Show that a group of order 130 contains a nontrivial normal subgroup. 

5. Show that no group of order 75 is simple. 

6. Show that a group of order 96, 150, or 200 is not simple. 

7. Let G be a group of order 35. Show that G is cyclic. 

8. Let G be a group of order 133. Show that G is cyclic. 

9. Let G be a group of order 5-7-19. 

(i) Show that G has a unique subgroup of order 5. 

(ii) Show that G is cyclic. 
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10. Let G be a group of order 100. Suppose that G has a unique Sylow 
2-subgroup. Show that G is commutative. 

11. Let G be a group of order 70. 

(i) Show that G has a unique Sylow 7-subgroup. 

(ii) Show that G has a unique Sylow 5-subgroup. 

(iii) Show that G has a cyclic subgroup of order 35. 

12. Let G be a group of order 385. Show that a Sylow 7-subgroup of G is in 
the center of G. 

13. Let G be a group of order 5 • 11 • 19. Show that a Sylow 19-subgroup of G 
is in the center of G and a Sylow 11-subgroup of G is a normal subgroup 
of G. 

14. Let G be a group of order 3 • 11 ■ 19. Show that a Sylow 11-subgroup of G 
is in the center of G and a Sylow 19-subgroup of G is a normal subgroup 
of G. 

15. Let G be a simple group of order 168. 

(i) Show that G has eight Sylow 7-subgroups. 

(ii) Let H be a Sylow 7-subgroup. Show that jiV(j( J ff)| = 21. 

(iii) Show that G has no subgroup of order 14. 

16. Show that there exists (up to isomorphism) only one group of order 77. 

17. Let G be a group of order 123. Show that for every positive divisor n of 
123, there exists a unique subgroup of order n in G. 

j 

18. Determine up to isomorphism all groups of order 70. 

19. Let G be a group of order p n m, p prime, p > m, n > 1. Show that G is 
not simple. 

20. Let Gbea group of order p 2 g, p and q are distinct primes. Show that G 
is not simple. 

21. Classify all groups of order 14. 

22. Prove that D n is a noncommutative group of order 2 n. 

23. Find Z(D n ). 

24. Find the conjugacy classes in D 2 n and £> 271+1 • 
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25. Let G be a group of order p 2 q 2 , where p and q are prime integers such 
that p > q. Prove the following. 

(i) The number of Sylow p-groups cannot be q. 

(ii) If the number of Sylow p-subgroups is g 2 , then p = 3 and q = 2 . 

26. Show that no group of order p 2 g 2 , where p and q are prime integers, is 
. simple. 

27. Show that Zs, Z 4 x Z 2 , Z 2 x Z 2 x Z 2 , and D 4 are nonisomorphic groups 
of order 8 . Prove that Qg is not isomorphic to the above groups. 

28. Show that Z 12 , Z 2 x Ze, Z 2 x S 3 , and A 4 are nonisomorphic groups. 

29. Write the proof if the statement is true; otherwise, give a counterexample. 

(i) If a prime p divides the order of a group G, then G contains a normal 
subgroup of order p. 

(ii) Let G and H be groups of order 39 and 21, respectively. These two 
groups are not isomorphic, but their Sylow 3-subgroups are isomorphic. 

(iii) There exists only one (up to isomorphism) group of order 65. 

(iv) Every group of order 76 contains a unique element of order 19. 
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Peter Ludvig Mejdell Sylow (1832 
-1918) was born on December 12, 1832, in 
Christiania (now Oslo), Norway. In 1850, 
he graduated from the Christiania Cathe¬ 
dral School. In 1853, he won a mathematics 
prize contest. In 1861, he traveled to Berlin 
and Paris after being awarded a traveling 
grant. He, jointly with Sophus Lie, prepared 
a new edition of Abel’s work from 1873 to 
1881. In 1902, he and Elling Holst published 
Abel’s correspondence. 

Sylow is best known for his work in fi¬ 
nite group theory. In 1845, Cauchy proved 
that every finite group has a subgroup of any 
prime order dividing the order of the group. 
In 1872, Sylow published a ten-page paper extending Cauchy’s result. The theorems 
proved in that paper are known as Sylow’s theorems, which we discussed in Chapter 
7. These theorems are fundamental for structural results in finite group theory. Sylow 
died on September 7, 1918. 
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Chapter 8 

Solvable and Nilpotent 
Groups 

8.1 Solvable Groups 

The purpose of this chapter is to present the Jordan-Holder theorem and the 
notion of solvable groups. The results chosen here lay groundwork for the 
determination of the solvability “by radicals” of a polynomial equation f(x) = 
0. In this regard, we show that the symmetric group S n on n symbols is not 
solvable for n > 5. 

Definition 8.1.1 Let G be a group and 


G = Ho 2 Hi D H 2 D • ■ • 2 H n = {e} 


be a chain of subgroups ofG. The chain is called a subnormal series (chain) 
if each Hi is normal in Hi-\. The chain is called a normal series (chain) if 
each Hi is normal in G. The chain is called a composition series if each Hi 
is a maximal normal subgroup of Hi- 1 , i.e., Hi ^ Hi- i, and if Hi C H C Hi -1 
and H is normal in Hi- 1 , then H = Hi- 1 , i = 1,2 The number of 
proper inclusions D in the chain is called, the length of the chain. The groups 
Hi-!/Hi are called the factors of the chain. 

In Definition 8.1.1, if Hi- 1 = Hi, then the group Hi-\/H l consists of a 
single element and is called a trivial factor of the chain. Hence the length of 
the chain is the number of nontrivial factors H z -\/Hi of the chain. 

Every group G has a normal chain, namely, G D {e}, since {e} is a normal 
subgroup of G. Furthermore, it can be shown by induction on |G| that every 
finite group G has a composition series. The reader is asked to verify this in 
the exercises. 





8.1. SOLVABLE GROUPS 


224 


We see in a composition series G = Hq D Hi D h 2 d 2 H n = {e} 
for a group G that the factors Hi-\fH x are simple groups. In some sense, the 
examination of G has been reduced to its composition factors. 


Example 8.1.2 Consider the symmetric group £4. Set 

( (l 2 3 4 \/l 2 3 4 \/l 2 3 4 \1 
1 j 6 ’\ v 2143y , \ v 3412y , ^4321 y |j 


and 


H 2 



1 2 3 4\1 

2 1 4 3 )J 


Now £4 = Ho D Hi D H 2 D H 2 = {e} is a subnormal chain which is not a 
norm,al chain since H 2 is not norm.al in £4 even though H 2 is normal in H\. 


Example 8.1.3 Consider the group (Zi2,+i2)* Since Z 12 is commutative, all 
subgroups are normal. Hence, the following chains are normal: 


Z 12 3 {[ 6 ]) 3 ([0]), 

Z 12 3 ([3]) 3 {[ 6 ]) 3 {[ 0 ]), 
Z 12 3 ([ 2 ]) 3 {[4]> 3 <[ 0 ]), 
Z 12 3 <[2]) 3 {[ 6 ]> 3 {[0]). 


All chains except Z\ 2 D ([ 6 ]) D ([0]) are composition series. 


Definition 8.1.4 Let G be a group a.nd. 


G = Ho D Hi D H 2 D • • • D D H n = {e} ( 8 . 1 ) 

be a subnormal series in G. A one-step refinement of this series is any 
series of the form 


G = H 0 DH 1 D---D H^ 1 5 H D H x D ■ • ■ D H n _i D H n — {e}, 


where H is a norm,al subgroup of Hi -1 and, Hi is a norm,al subgroup of H , 
i = 1 , 2 ,... ,n. A refinement of (8.1) is a subnormal series which is obtained 
from, ( 8 . 1 ) by a finite sequence of one-step refinements. A refinement 


G = K 0 D K x D K 2 D ■ • • D Km —1 DK m = {e} (8.2) 

of ( 8 . 1 ) is called, a, proper refinement if there exists a subgroup Kj in, ( 8 . 2 ) 
which, is different from, each Hi of ( 8 . 1 ). 
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Thus, a chain of subgroups 


G = K 0 D Kx D K 2 3 - • • 3 Km _! 3 K m = {e} 


of G is called a refinement of a chain of subgroups 


G = Ho 3 Hi D H 2 D ■ • ■ D H n —\ 3 H n = {e} 


of Gif 


{ffo.ffi, #2,. £ {JCo, /fi, /r 2 , - - -, 

and is called a proper refinement if 

{H 0 , H u Ht, C {K 0 ,K 1 ,K 2 ,...,K m }. 


Example 8.1.5 (i) Consider the subnormal series 

Z 36Z 3l2Z 3 48Z 3 {0}. 


(8.3) 


The subnormal series 


Z D2Z D 6Z 312Z D 48Z 3 {0} (8.4) 

is a one-step refi.nem.ent of (8.3). Again the subn.orm.al series 

Z 32Z 3 6Z 312Z 3 24Z 3 48Z 3 {0} (8.5) 

is a one-step refinement of ( 8.4). From the definition, it follows that both (8.4) 
and, (8.5) are proper refinements of (8.3). 

(ii) In Example 8.1.3, Z 12 3 ([3]) 3 ([6]) 3 ([0]) is a refinement of 


Z 12 3 ([61) D <[0]> 


while Z 12 3 ([2]) D ([4]) D ([0]) is not. 


Example 8.1.6 Consider the group (Z ,+). Then Z does not have a compo¬ 
sition series since every subgroup of Z is cyclic and every subgroup ( n) of Z 
contains an infinite chain, namely, 

(n) 3 (2 n) 3 <4n) 3 (8 n) 3 ■ • ■. 


Theorem 8.1.7 A subnormal senes in a group G is a, composition series if 
and, only if it has no proper refinement. 
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Proof. Let 


G = H 0 DH l DH 2 D---D H n ^ D H n = {e} (8.6) 

be a composition series. Let 

G = H 0 D H x D ■ • • D Hi—i D H D H t D ■ • • D D H n — {e} (8.7) 

be a one-step refinement of (8.6). Since (8.6) is a composition series, H z is a 
maximal normal subgroup of Hi- 1 . Thus, either H — Hi- 1 or H = Hi. Hence, 
it follows that (8.6) has no proper refinement. 

Conversely, suppose that 

G = Ho D ffi D H 2 2 • • • 2 Hn- 1 2 Hn = {e} (8.8) 

is a subnormal series, which has no proper refinement. Suppose (8.8) is not a 
composition series. Then there exists a subgroup Hi in (8.8) such that Hi is 
not a maximal normal subgroup in Hi- j. Thus, there exists a subgroup H such 
that Hi- 1 7 ^ H ^ Hi, H is a normal subgroup of Hi- 1 , and if* is a normal 
subgroup of H. This produces a proper refinement of (8.8), a contradiction. 
Hence, (8.8) is a composition series. ■ 

Definition 8.1.8 Two subnormal chains for a group G 

G = H 0 D Hi D H 2 D • • ■ D H n -i ^H n = {e} (8.9) 

G = K 0 D Ki 2 K 2 D ... D 2K m = {e} (8.10) 

are called equivalent if there is a one-one correspondence between the nontriv¬ 
ial factors of (8.9) and. (8.10) such that corresponding factors are isomorphic. 

If the subnormal chains (8.9) and (8.10) are equivalent, then the length of 
(8.9) equals the length of (8.10). 


Example 8.1.9 Consider the subnormal series 

Z D4Z D 12Z D 24Z D 120Z D {0} (8.11) 

Z D2Z D 8 Z D24Z D 120Z D {0}. (8-12) 

The factors of (8.11) are 

Z/4Z ~ Z 4 , 4Z/12Z ~ Z 3 , 12Z/24Z ~ Z 2 , 

24Z/120Z ~ Z 5 , and 120Z/{0}~ Z 

and the factors of (8.12) are 

Z/ 2 Z ~ Z 2 , 2Z/8Z ~ Z 4 , 8Z/24Z ~ Z 3 , 

24Z/120Z ~ Z 5 , and 120Z/{0}~ Z. 

Hence, there exists a one-one correspondence between the factors of (8.11) and 
(8.12). Consequently, (8.11) and ( 8 . 12 ) are equivalent. 
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Theorem 8.1.10 (Zassenhaus Lemma) Let H ', H, if', a,nd K be subgroups 
of a group G such that H' is a normal subgroup of H and K' is a norm,a,l 
subgroup of K. Then H'(H D K ') is a norrrml subgroup of if'(if P if) and 
K f (H' n K ) is a normal subgroup of K'(H P K). Furthermore, 

H'(HDK) _ if'(if ri if) 
if'(if Pif') ~ K'(H’ fl K)' 

Proof. From the hypothesis, it follows that H P K 1 and H' fl K are normal 
subgroups of H n K. Thus, (if fl if') (if' fl K) is a normal subgroup of if fl if. 
Set J = {HO if'Xif'P if). 

Define the function / : if'(if D K) —> (if Pif)/J as follows: If a G if'(ff Pi 
if), then a = h'b , where h! 6 if' and b G if D if. Set f{a) = Jb. Let a\, 
a2 G if'(if D if). Then a\ = h-^b\, 02 = h 2 b2 for some h l ,h 2 G if' and 61,62 6 
if fl if. Suppose 01 = a2- Then h^bi = h! 2 62. Thus, ^ if' fl 

(if n if) C if' D if C J. Hence, J61 = J&2 and so f{a\) = f{a2). Thus, 
/ is well defined. Since if' is a normal subgroup of if, b\h 2 bf l G if'. Now 
aia,2 = h\bih' 2 b2 = hl^ih^bf 1 b\b2 — /?/fri&2, where hi = h^bih^bf 1 G if'. Hence, 
7(0102) = Jb\b2 = Jb\Jb2 — /(o-i)/(a2)- Therefore, / is a homomorphism. 
From the definition of /, it follows that / maps if'(if n if) onto (if Pi if)/ J. 
Also, it is easy to verify that Ker / = if'(if P if'). Hence, by Theorem 5.2.2, 

g'(g n K) (H n if) 

H’(HnK’) J ' 

By symmetry, 

if'(ffPif) (ifPif) 
if'(if'Pif)“ J 

Finally, the desired isomorphism follows from these two isomorphisms. ■ 
Theorem 8.1.11 (Schreier) Any two subnormal series 


G = H 0 DH 1 DH 2 O...D H n —\ D H n = {e } (8.13) 

G = ifo D ifi D if 2 2 • ■ • 3 ifm—l D Km = {e} (8.14) 

of a group G have refinements which are equivalent. 


Proof. Between each Hi and H 1+ \. insert the group if^+i {H l P Kj). j = 
0.1.2, ..., m. From the normality assertions of the Zassenhaus lemma, this 
refinement of (8.13) is a subnormal chain with m,n (not necessarily strict) in¬ 
clusions. Between each Kj and if/+i insert the group Kj + \(Kj P H t ), i = 0, 
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1,2,... ,n. This refinement of (8.14) is also a subnormal chain with ran inclu¬ 
sions. The final refinements are 


• • • 2 H i+1 (Hi n Kj) D H i+1 (H Z fl K j+1 ) D • • • 


and 

• ■ * 2 Kj+i{Kj H Hi) D Kj+i(Kj H Hi + i) D • • •. 

From the Zassenhaus lemma, 


H i+ i(Hi n Kj)/H i+l {Hi n K j+1 ) ~ K j+l (Kj n H i )/K j+l {K j n fii+i). 

Hence, we have the desired result. ■ 

Theorem 8.1.12 (Jordan-Holder) Any two com,position series of a group 
are equivalent. 

Proof. Since composition series are subnormal series, any two composition 
series of G have equivalent refinements. Now a composition series has no proper 
refinements. Thus, a composition series is equivalent to every refinement of it¬ 
self. Hence, any two composition series of a group are equivalent. ■ 

By the Jordan-Holder theorem, we find that if a group G has a composition 
series of length n, then the length of any composition series of G must be n. 
This n is called the composition length of the group G. 

We now show that the fundamental theorem of arithmetic can be estab¬ 
lished from the Jordan-Holder theorem. Let n be a positive integer greater than 
1 and consider the group (Z n ,+ n ). Since Z n is finite, Z n has a composition 
series. Let 


Z n = H 0 DH 1 DH 2 D.-.D H k _ x 2 H k = {[0]} 

be a composition series. The factors H z _i/H z are simple Abelian groups. Hence 
each factor is of prime order. Let \Hi-i/Hi \ — pi. Now 


n = |Z n | = |£f 0 /#i| • \Hi/H 2 \• ■ • \H k - X /H k \ = p lP 2 ---p k . 


This proves that every integer n > 1 can be expressed as a product of prime 
integers. The uniqueness of this factorization follows from the equivalence of 
the composition series. 

Example 8.1.13 Consider the group (Z 3 q,+ 3 o)- Then Z 30 has the following 
two composition series. 


Z 30 D ([5]> Z> <[10]) D {[0]) 
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Z 30 D ([2]) D {[6]) D ([0]) . 

Now 

Z 30 /{[5]) ^Z 3 o/<[2]>, 
but we have the following isomorphisms: 

Z 30 / ([5]> =! <[6]) / ([0]> 

Z 30 / ([2]> ~ ((5]) / {[10]) 

([ 2 ]>/{[ 6 ])~([ 10 ]>/([ 0 ]). 

Definition 8.1.14 A group G is called solvable if it has a subnormal series 


G = H 0 DH 1 DH 2 2---^ H n -1 D H n = {e} 

such that Hi/Hi+i is commutative, i = 0,1,..., n — 1. Such a subnormal series 
is called a solvable series for G. 


Every commutative group is solvable since G = Ho D H\ — {e} satisfies 
the above definition. 


Example 8.1.15 Consider the symmetric group S 3 . Then 

*4b *?)■(; ”)}=■<■> 

is a solvable series for S 3 . Hence, S 3 is solvable. 

Example 8.1.16 Consider the symmetric group S 4 . Then 

S 4 D A 4 D < e, 

l V * \ i± / ° J v 0 * 1 

I / 1 9 3 A \ I 

D {e} 

is a solvable series for S 4 . Thus, S 4 and A 4 are solvable 


1 2 3 4 \ / 1 2 3 4 
2143^^3412 

12 3 4 
2 14 3 


1 2 
4 3 


3 

2 


4 

1 


D 


Since the symmetric groups Si and S 2 are commutative, they are solvable. 
Thus, S n is solvable for n < 4. In Theorem 8.1.27 below, we show that S n is 
not solvable for n > 4. The order of the alternating group A 3 is 3. Hence, A 3 
is commutative and thus solvable. By Example 8.1.16, A 4 is solvable. Thus, 
A n is solvable for n <4. 

In the next few theorems, we show how the solvability of a group is associ¬ 
ated with the solvability of a normal subgroup and the quotient group created 
by the normal subgroup. 


Theorem 8.1.17 If G is a solvable group, then every subgroup of G is solvable 
and every hom.om.orphic image of G is solvable. 
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Proof. Let 


G = Ho 2 Hi 2 H 2 T ■ ■ • 2 H n —i LH n = {e} 

be a solvable series of G. Let K be any subgroup of G. Set Ki — K f\ Hi, 
i — 0,1, ..., n. We shall show that the chain 


K = K 0 2 Ki D K 2 D • • • 2 Kn -1 ^K n = {e} 

is a solvable series for K. It follows that H{+\ C\K is a normal subgroup of the 
group Hi fl K. That is, Ki + \ is a normal subgroup of Ki. Now 


Ki+ 1 = Kn H^ i-i = Kn Hi n H l+ 1 = Ki n Hi+\. 


Thus, Ki/Ki + 1 = Kif(Kif]Hi + i). Hence, by the second isomorphism theorem 
(Theorem 5.2.6), we have the isomorphism 


K t /K i+1 ~ (KiH i+1 )/H i+1 . 


The quotient group (KiHi+i)/Hi+\ is commutative since it is a subgroup of 
the commutative group Hi/H l +\. Thus, Ki/Ki + \ is commutative and so K is 
solvable. 

Let / be a homomorphism of G onto a group G. Set Hi = f(Hi), i = 
0,1,... ,n. Since / is an epimorphism, f(H l+ i) is a normal subgroup of f(H l ). 
Also, Hi D Hi + 1 implies that f(Hi) D f(Hi+ 1 ). Hence, 

G = H 0 DH l DH 2 ^---2 H n -1 3 H n = {e} (8.15) 

is a subnormal series of G. We now show that f(Hi)/f(H l+ \) = Hi/Hi+i is 
commutative. Define g : Hi —> Hi/Hi+i by g(h t ) — f(hi)H l+ i. Since / is an 
epimorphism, it follows that g is an epimorphism of Hi onto Hi/Hi + Note 
that for any h i+ 1 € H i+1 C H t , g{h i+ 1 ) = /(^+i)ift+i = f {hi+\)f {Hi+\) = 
f(Hi+ 1 ). Hence, Hi+i C Ker g. Thus, g induces an epimorphism of Hi/Hi+i 
onto Hi/Hi+i. Since Hi/Hi + i is commutative, it follows that H % /Hi + \ is com¬ 
mutative. Consequently, the subnormal series (8.15) is a solvable series, proving 
that G is a solvable group. ■ 


The following corollary is immediate from Theorem 8.1.17. 

Corollary 8.1.18 If G is solvable and H is a normal subgroup of G, then H 
and G/H are solvable. ■ 

Theorem 8.1.19 Let H be a normal subgroup of a group G. If both H and 
G/H a,re solvable, then G is solvable. 
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Proof. Let 

G/H = K 0 ^K 1 DK 2 D...D F m _! DKm = {eH} = {H} 

be a solvable series for G/H. By Corollary 5.2.12, there are subgroups K % of 
G , i = 0 , 1 , ... ,m, such that K l+ 1 is a normal subgroup of K z , Ki = K t /H, 
i = 0,1,... ,m — 1, G — Ko, and H = K m . Also, Ki/K i+ 1 ~ K l /K z+ i by 
the third isomorphism theorem (Corollary 5.2.9). Since H is solvable, H has a 
solvable series, say, 


H = H 0 D D H 2 D • • O H n —i DH n = {e}. 


Thus, 


G = K 0 2 K x D • • • D K m _ i DffDffiD-O H n -i 2 H n = {e} 


is a solvable series for G. That is, G is solvable. ■ 

Theorem 8.1.20 Let G {e} be a finite solvable group. Then the factor 
groups of any composition series of G are cyclic groups of prime order. 

Proof. The proof is by induction on |(7|. If |(7| is a prime, then the theorem 
is valid since G D {e} is the only composition series for G. Hence, the theorem 
is valid for |(T| =2. Suppose the theorem is true for all groups of order < |G|, 
where \G\ > 2. If \G\ is not a prime, then G has a nontrivial normal subgroup 
H. (If G does not have a nontrivial normal subgroup, then G D {e} is a 
composition series for G so that G ~ G/ (e) is commutative. Thus, G has no 
proper subgroups. Hence, [G| is a prime, a contradiction.) By the induction 
hypothesis and Corollary 8.1.18, G/H and H have the composition series 

G/H - K 0 D Ki D K 2 D • • • D Km- 1 UKm = {e} 


and 

H = Ho D Hi D H 2 D • • O H n _ 1 D H n = {e}, 

respectively, such that each Ki/K{+\ and each Hi/H{ + i are cyclic groups of 
prime order. If we choose subgroups Ki of G corresponding to K[ as in Theorem 
8.1.19, then it follows by similar arguments that 

G = K 0 D Ki D • • • D K m - 1 D H D H 1 D • ■ • D H n _ 1 D H n = {e} 

is a composition series of G satisfying the conditions of the theorem. Thus, by 
the Jordan-Holder theorem, every composition series of G satisfies the condi¬ 
tions of the theorem. ■ ' 
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We now proceed to establish the unsolvability of S n , n > 5. We first intro¬ 
duce the notion of the commutator subgroup of a group and obtain its basic 
properties. We also give a necessary and sufficient condition for the solvability 
of a group in terms of the commutator subgroup. We then apply these results 
to show that S n , n > 5, is not solvable. 

Definition 8.1.21 Let G be a group and a, b £ G. The commutator of a 
and b is the element aba~ l b~ l . Set A — {aba~ l b ~ 1 | a,6 £ G} and let G' be 
the subgroup of G generated by A. G' is called, the derived or commutator 
subgroup ofG. 

If G is commutative, then A = {e} and so G' = {e}. Conversely, if A = G' = 
{e}, then aba~ l b~ l ~ e for all a, b £ G. Therefore, ab — ba for all a, 6 € G, 
i.e., G is commutative. Thus, G is commutative if and only if G' = {e}. 

Theorem 8.1.22 The derived subgroup G' of a, group G is a normal subgroup 
of G and GIG 1 is commutative. 

Proof. Let a, 6, g £ G. Now 

g{aba~ l b~ l )g~ l = (gag~ 1 )(gbg~ 1 )(ga ~ 1 £ _1 )(p& _1 g~ l ) = cdc~ l d ~ l , 

where c = gag~ l and d = gbg~ l . This implies that for any commutator 
aba~ 1 b~ l and for any g £ G, g(aba~ 1 b~ 1 )g ~ 1 is a commutator. From this, 
it follows that gG'g~ l C G' for all g £ G. Hence, G r is a normal subgroup of 
G. Next, we show that G/G' is commutative. Let a,b £ G. Then (ba)~ 1 ab — 
a~ l b~ l ab £ G' and so abG' = baG\ i.e., aG'bG' - bG'aGh Hence, G/G' is 
commutative. ■ 

Theorem 8.1.23 Let G' be the derived subgroup of a group G and H be a. 
subgroup of G. Then H G' if and only if H is a norm,al subgroup of G and 
G/H is commutative. 

Proof. Suppose H D G'. Let h £ H and a £ G. Then aha~ l h~ l £ G 1 C H. 
Thus, aha -1 = ( aha~ l h~ l )h £ H. Hence, H is a normal subgroup of G. 
Let us now show that G/H is commutative. To do this, let us consider 
two arbitrary elements aH, bH in G/H. Then ( aH){bH){aH)~ l {bH)~ 1 = 
aHbHa~ l Hb~ l H = aba~ l b~ l H. Since aba~ 1 b~ 1 £ G' C H, it follows that 
(aH)(bH)(aH)~ 1 (bH)~ l — H. Therefore, aHbH = bHaH , proving that G/H 
is commutative. Conversely, suppose H is normal in G and G/H is com¬ 
mutative. Let a, b £ G. Then ( aH)(bH) = ( bH)(aH ). This implies that 
a~ l b- l ab £ H. Hence, G' CH.U 
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Definition 8.1.24 Let G' be the commutator subgroup of a group G.. Set 
G^ = G 1 and define inductively 

Q( k + 1 ) — Q( k )f 

the commutator subgroup of G^ k \ k > 0. For any positive integer k, G^ is 
called the kth commutator subgroup of G. 

The following theorem gives a necessary and sufficient condition for a group 
to be solvable in terms of a commutator subgroup. 

Theorem 8.1.25 Let G be a. group. Then G is solvable if a.nd, only if there is 
a positive integer m such tha,t G= {e}. 

Proof. Suppose G = {e}. Then by Theorem 8.1.22, the chain 

G D G (1) D • ■ • D G (m_1) D G (m) = {e} 

is a solvable series. Thus, G is solvable. Conversely, suppose G is solvable. 
Then G has a solvable series, say, 

G = Ho D H, D H 2 D • ■ • D D H n = {e}. 

Since Hi+i is normal in Hi and Hi/Hi+i is commutative, we have by Theorem 
8.1.23 that the commutator subgroup Hi of Hi is contained in H 1+ \. Thus, 

Hi 3 H ' 0 = G^\ H 2 DH[D G< 2 >,..., {e} = H n 2 H' n _ , 2 G<">. 

Hence, 6' 1 "' = {e}. ■ 

Lemma 8.1.26 Let S n be the symmetric group on n symbols. If n > 5, then 
Sn ^ contains every 3 -cycle of S n for k = 1,2,_ 

Proof. Let 7r = (a b c) be any 3-cycle in S n . Since n > 5, there exist symbols 
d, / such that a, 6, c, d, / are distinct. Set a = (a b d) and j3 = (a c /). Let H 
be any subgroup of S n with the property that H contains every 3-cycle of S n - 
Then 7r ,a,/3 € H. Hence, 

{a b c) = {a b d) o (a c f) o (a d b) o (a f c) = afioT 1 ^ 1 G H', 

where H' is the derived subgroup of H. From this, it follows that Sn ^ contains 
every 3-cycle of S n . We can employ induction to obtain the desired result . ■ 

In the next theorem, we show that S n is not solvable for n > 5. 

Theorem 8.1.27 The symmetric group S n on n symbols is not solvable for 
n > 5. 
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Proof. Since S^ contains every 3-cycle of S n for k = 1 , 2, ..., there does 
not exist a positive integer m such that — {e}. Thus, by Theorem 8.1.25, 
S n is not solvable. ■ 

8.1.1 Worked-Out Exercises 

<0 Exercise 1 Let G be a group of order pgr, where p, g, r are primes and 
p > q > r. Show that G is solvable. 

Solution: The number of Sylow p-subgroups is 1 + kp, where 1 + kp divides 
qr. Suppose k 7 ^ 0. Since p > q > r, 1 + kp — qr. The number of Sylow q- 
subgroups is 1 + k'q , where 1 + k'q divides pr. Suppose k! 7 ^ 0. Since q > r, 
either 1 + k'q — p or pr. In either case, 1 + k'q > p. The number of Sylow 
r-subgroups is 1 + k"r , where 1 + k"r divides pq. Suppose k” 7 ^ 0. Then either 
1 + k"r = q or p or pq. Hence, in either case, 1 + k"r > q. Thus, G has gr(p — 1 ) 
elements of order p, at least p(g — 1 ) elements of order g, and at least g(r — 1) 
elements of order r. Since G has pgr elements, pgr > gr(p — 1)+ p(g — 1)+ 
g(r — 1) + 1. This implies that 0 > pg — p — g + 1 or 0 > (p — l)(g — 1). 
Therefore, (p — l)(g — 1) = 0, which implies that either p = 1 or g = 1, a 
contradiction. Thus, either k — 0 or k' = 0 or k" = 0. Suppose k = 0. Then G 
has a unique Sylow p-subgroup, say, H. Now H is a normal subgroup of G and 
G/H is of order qr. By Exercise 11 (page 238), we find that G/H is solvable. 
Since H is of order p, H is solvable. Hence, by Theorem 8.1.19, G is solvable. 
Similarly, if either k' = 0 or k" = 0, then G is solvable. 

0 Exercise 2 Let H 7 ^ {e} be a subgroup of a solvable group G. Prove that 
H' ± H. 

Solution: Suppose H' — H. Then — ( H')' = H' = H 7 ^ {e}. Now 
by induction, we can show that H^ — H 7 ^ {e} for any positive integer n. 
On the other hand, if is a subgroup of a solvable group and so H is solvable. 
This implies that there exists a positive integer n such that H^ — {e}, a 
contradiction. Hence, H' 7 ^ H. 

Exercise 3 Let G be the group of all nxn invertible matrices over R, n > 3. 
Show that G is not solvable. 

Solution: Let E z j be the nxn matrix whose (i,j) entry is 1 and all other 
entries are zero. Then 


E{j E r s 


E is if j — r 
0 if p 7 ^ r. 
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Now for the identity matrix / and for i 7 ^ j, I + Eij E G and (/ + = 

I — E z j. Let T be the subgroup generated by {/ + E ZJ \ i ^ j}. Since n > 3, we 
can find an integer k such that 1 < i 7 ^ k 7 ^ j <n. Now 

(I + Eik)(I + Ekj)(I + Eik) l (I + Ekj) 1 = (I + Eik)(I + .E'fcj) 

(/ - Eik)(I ~ ) 

— (-f + Ekj + E^ + Eij ) 

(/ — E k j - E lk + Eij ) 

= (/ + £*;)■ 

Therefore, (/ + E 1J ) E T 7 , proving that T C T, As a result T = T'. Thus, T is 
not solvable and so (T is not solvable. 


0 Exercise 4 Let (7L(2,R) be the group of Example 2.1.10. Prove that the 
derived subgroup of GL( 2, R) is the subgroup 


£X(2,R) 


a b 
c d 


E GL{2, R) | ad — be — 1 


Solution: Let R* be the multiplicative group of nonzero real numbers. 
Define / : GL{ 2, R) R* by 


/( 


a 


c d 


) = ad — be 


for all 


a b 
c d 


E GL{ 2, R). Now / is an epimorphism with Ker / = SL( 2, R). 
Hence, SL(2, R) is a normal subgroup of GL( 2, R) and 

GL(2, R) /SL(2, R) ~ R*. 


This implies that GL(2, R)/5X(2, R) is a commutative group and (GL( 2, R)) 

a 6 


C SL{2, R). Let us now show that SL(2, R) C (GL( 2, R)) 7 . For this, let 
E SL( 2, R). Then ad — be = 1. If c 7 ^ 0, then 


c d 


a b 
c d 


1 ^ 
0 


1 0 
c 1 


1 

0 


d—1 


c 

1 


Now for any r E R, 


1 r 


" 1 

—r 


" 1 

0 ’ 


1 r 


’ 1 

0 " 

0 1 


0 

1 


_ 0 

1 

2 


0 1 


0 

2 



" 1 

—r 


" 1 

0 " 


1 

\ — 1 

1 

-i 


"-1 

’ 1 



0 

1 


0 

1 

2 


0 1 



0 


-1 


e(GL( 2,R))' 
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and 


’ 1 0 " 


*1 o' 


P— 1 

o 


' 1 0 " 


r 1 


0 2 


r 1 


o 



1 0 

—r 1 


€ (<?L( 2 ,R))'. 


a b 
c d 


E (GL{ 2,R))'. 


Hence, 

Suppose c = 0. Then ad = 1. Thus, a ^ 0 and 


a b 
0 d 


a 0 

0 a 


s 

o 

_1 


T - 1 

o 

1_ 


a 2 a 


1 

o 

1 — 1 

_1 


0 i 

CL ^ 


1 0 


2 a 2 a 2 


1 0 



l * 

a 

o 1 


3a 3a^ 
2 _ _1_ 

3a 


3a^ 


Also, from above, 

r i si 

0 1 

€ {GL{ 2, R))'. As a result, 

a b 

c d 

Consequently, SL(2, R) = 

(GL(2,R))'. 



6 (GL(2,R))'. 

€ (GL(2,R))'. 


Exercise 5 Prove that in a group G, any refinement of a solvable series is a 
solvable series. 


Solution: Let 


G = H 0 D Hi D H 2 D ■ • • D H n -i D H n = {e} (8.16) 

be a solvable series in G and let 

G = Ho D ■ ■ ■ D Hi _1 D H D Hi D ■ • ■ D D H n = {e} (8.17) 

be a one-step refinement of (8.16). From (8.16) Hi-i/Hi is commutative. Now 
the group H/Hi is a subgroup of Hi-\f H{. Hence, H/Hi is commutative. Again 

(Hi-i/Hi)/(H/Hi) ~ Hi-i/H 

implies that Hi-i/H is commutative. Thus, (8.17) is a solvable series. Hence, 
any one-step refinement of (8.16) is a solvable series. By induction, any refine¬ 
ment of (8.16) is a solvable series. 

0 Exercise 6 Find all composition series of the group Z/ (42). Verify that 
they are equivalent. 

Solution: Now the subgroups of Z/ (42) are Z/ (42), 2Z/ (42), 3Z/ (42), 
6Z/ (42), 7Z/ (42) , 14Z/ (42) , 21Z/ (42) , and {(42)}. Hence, the composition 

Z/ (42) D 2Z/ (42) D 6Z/ (42) D {(42)} 
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Z/ (42) D 2Z/ (42) D 14Z/ (42) D {(42)} 

Z/ (42) D 3Z/ (42) D 6 Z/ (42) D {(42)} 

Z/ (42) D 3Z/ (42) D 21Z/ (42) D {(42)} 

Z/ (42) D 7Z/ (42) D 14Z/ (42) D {(42)} 

Z/ (42) D 7Z/ (42) D 21Z/ (42) D {(42)}. 

Each of the above six composition series has three factors. These factors are 
nothing but the groups Z 2 , Z 3 , and Z 7 . Hence, all these composition series are 
equivalent. 

8.1.2 Exercises 

1 . Let G be the group of symmetries of the square. Prove that the following 
series are composition series for G : 

G D { 7 * 180 , r 360, h, n} D {r 3 60, h} D { 7 * 360 } 

and 

G D {t’iso? r 3 60 ) d\, d 2 } D {r 360 ,di} D {r 360 }- 

Establish the equivalence of these composition series. Verify that { 7 * 360 , di} 
is normal in { 7 * 180 ,^ 360 ,^ 1 ,^ 2 }, but not normal in G. 

2. Find all composition series of the group Z/ ( 66 ) . Verify that they are 
equivalent. 

3. Find all composition series of Z 2 o- 

4. Write all composition series of S3, S4, A4, D4 , and Z 2 x Z 2 . 

5. Prove that every finite group has a composition series. 

6 . Let G be a commutative group. Show that G has a composition series if 
and only if G is finite. 

7. Let G be a group. Show that G' — {aia 2 • • • a n aj) 1 a ^" 1 • • • a " 1 | a.j E 
G,n> 2}. 

8 . Show that a group G is commutative if and only if G’ = {e}. 

9. Let H be a subgroup of G. Show that H' C G'. 

10. Let N be a normal subgroup of a group G such that NDG' = {e}. Show 
that 

(i) N C Z(G), 

(ii) Z(G/N) = Z(G)/N. 
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11 . Let G be a group of order pq (p, q primes). Show that G is solvable. 

12 . Let G be a group of order p 2 q (p, q primes). Show that G is solvable. 

13. Let G be a group of order p 2 q 2 (p,g primes). Show that G is solvable. 

14. Write a solvable series of S 3 x S 3 . 

15. Let G be a simple and solvable group. Show that G is commutative. 

16. Prove that a finite direct product of solvable groups is solvable. Hence, 
show that S 3 x Z is an infinite noncommutative solvable group. 

17. Let H be a normal subgroup of a group G. Prove that G has a composition 
series if and only if both H and G/H have composition series. Also, show 
that G has a composition series containing H. 

18. Prove that a finite group G is solvable if and only if H' 7 ^ H for any 
subgroup H 7 ^ {e} of G. 

19. Let G be a solvable group with a composition series. Show that G is 
finite. 

20. Prove that a group G is solvable if and only if GfZ{G) is solvable. 

21 . Let A and B be subgroups of a group G. If A and B are solvable and A 
is normal in G , prove that AB is a solvable subgroup of G. 

22 . For the following statement, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) If G 7 ^ {e} is a solvable group, then Z{G) 7 ^ {e}. 

(ii) Let G be a solvable group of order m. Then for every positive divisor 
n of m, G has a subgroup of order n. 

(iii) Every group of order 15 is solvable. 

(iv) Every solvable group has a composition series. 

(v) Every solvable series is a composition series. 

(vi) Every composition series is a solvable series. 

(vii) If two groups have equivalent composition series, then the groups 
are isomorphic. 
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8.2 Nilpotent Groups 

In this section, we study another class of groups called nilpotent groups. We 
show that the converse of Lagrange’s theorem also holds for such groups. 

Definition 8 . 2.1 A chain Go C G\ C G 2 C • ■ • Q G n of normal subgroups 
of a group G is called a central series if Gi+i/Gi C Z{G/Gf) for all i = 
0 , 1 ,... ,n — 1 . 

Definition 8 . 2.2 A group G is called nilpotent if G has a central series 


G 0 CGiCG 2 C-.-CG n 


such that Go — {e} and G n — G. 

From the definition of a nilpotent group and from the commutative property 
of Z(G/Gi ), it follows that every nilpotent group is solvable and also that every 
commutative group is nilpotent. 

Example 8.2.3 The symmetric group S 3 has only two norm,al series, 

{e} C S 3 


and 





1 2 3 

2 3 1 


1 2 3 
3 1 2 


CS 3 . 


For the first series, S 3 /{e} — S 3 ^ Z(S 3 /{e}) = {e}. 
let 


H = 


1 2 3 

2 3 1 


1 2 3 
3 1 2 


For the second, series, 


Now H/{e} % Z(So/{e}). Hence, S 3 is not a nilpotent group. However, S 3 is 
solvable. 


Finite p-groups are the most important examples of nilpotent groups. 
Theorem 8.2.4 Every finite p-group is nilpotent. 


Proof. Let G be a finite p-group. If |G| = 1, then G is nilpotent. Suppose 
|G| > 1. Then Zi = Z{G ) ^ {e} by Theorem 7.2.7. If G ^ Z lt then \G/Z Y \ > 
1 and hence by Theorem 7.2.7, \Z(GfZ\)\ > 1 . Now there exists a normal 
subgroup Z 2 of G such that Z\ C Z 2 and Z 2 fZ\ — Z{GfZ\). Thus, we have 
{e} C Z\ C Z 2 . If G 7 ^ Z 2 , we repeat the above process and obtain a normal 
subgroup Z 3 of G vsuch that ZojZ 2 — Z(G/Z 2 ) and {e} C Z\ C Z 2 C Z 3 . 
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Since G is finite, this process must terminate after a finite number of steps. 
We obtain the normal series 

{e} C Zi C Z 2 C • ■ • C Z n — G 

such that Zi+\fZi — Z{G/Zj). Hence, G is nilpotent. ■ 

For a group G, let us define Zi(G ) as follows: 

Z 0 (G) = {e}, Z 1 (G) = Z(G). 

Now Z\(G) is a normal subgroup of G and Z(G/Z\{G)) is a normal subgroup 
of GjZ\{G). Hence, there exists a unique normal subgroup Z 2 (G) of G such 
that Zi(G) C Z 2 {G) and Z 2 {G)/Zi{G) = Z{G/Zi{G)). Suppose Zi{G), i > 1, 
has been defined, i.e., Zi{G) is the normal subgroup of G such that 


Zi-i(G) C Zi{G) and Z i {G)/Z l ^ 1 {G) = Z(G/Z^G)). 

There exists a unique normal subgroup Zi+\(G) of G such that 
Zi(G) C Z i+1 (G) and Z t+l (G)/Z l (G) = Z{G/Zi{G)). 

Thus, we have the chain of normal subgroups 

{e} = Z 0 (G) C Z^G) C Z 2 {G) C • • ■ C Z n {G) C • • • 

and Zi+i(G)/Zi(G) = Z(G/Zi(G)), i > 0. This chain of normal subgroups is 

called the ascending central series of G. 

Theorem 8.2.5 Let G be a group such that Z n (G ) = G for som,e nonnegative 
integer n. Then G is nilpotent. 

Proof. We have the normal series 


{e} - Z 0 (G) C Z X {G) C Z 2 (G) C • • • C Z n (G0 = G 

such that Zi+\{G)/Zi(G) = Z(G/Zi(G)), i = 0,1, ..., n — 1. Hence, G is 
nilpotent. ■ 

Let G be a group and a. b € G. We denote by [ a.b} the commutator 
aba~ l b~ l . Let A and B be subgroups of G. We denote the subgroup gener¬ 
ated by elements [a, 6], for all a € A, b £ B, by [A, B], 

Lemma 8.2.6 Let A and B be subgroups of a group G a,nd A be normal in G. 
Then [B,G] C A if and only if ABf A C Z{G/A). 
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Proof. Suppose [B,G] C A. Then for all 6 E B, g G G, bgb l g 1 E A. 
This implies that AbAg = AgAb. Thus, Ab E Z(G/A). Let a E A and b G 
H. Then Aab = AaA6 = AAb = Ab G Z(G/A). Hence, AB/A C Z(G/A). 
Conversely, suppose AB/A C Z(G/A). Let 6 G S, g G G. Then A&g6 _1 g _1 = 
A&AgA&'^Ag' -1 = A since A£> G Z(G/A). This implies that [6,g] G A. Thus, 
[B,G]CA. ■ 

Theorem 8.2.7 Let G be a nilpotent group. Then there exists a nonnegative 
integer n such that G = Z n {G). 

Proof. Since G is nilpotent, there exists a normal series 


{e} = Go C Gi C G 2 C • • - C G n = G 

such that Gi/Gi -1 C Z(G/G*_ i), z = 1,2, ...,n, for some n. We now prove 
by induction on i that Gi C Zj(G) for all z = 0,1,.. ., n. If i — 0, then Go = 
{e} = Zo(G). Suppose that G{ C Zj(G) for some i > 0. Since G^G^ +i/G; = 
Gi+i/Gi C Z{G/Gi ), we have by Lemma 8.2.6 that [Gi+i,G] C Gj C Z z (G). 
Thus, by Lemma 8.2.6, 

Zi(G)G i+ i/Zi(G) c Z{G/Zi(G)) = Z i+1 {G)/Zi(G). 

This implies that G^+i C Zj(G)Gi+i C Z* + i(G). Hence, by induction, G* C 
Zi(G) for all z = 0, 1,... ,n. Since G n — G, Z n (G) = G. ■ 


Let G be a group. Define the subgroups G^ of G inductively as follows: 
GW - G, G^ = [G^,G], ..., GW = [Gt* -1 1,G], z > 1. It can be easily seen 
that 

G = G [1) D G [2) D G t3! D • ■ • 

is a central series. This series is called the descending central series of G. 

Theorem 8.2.8 A group G is nilpotent if and only if there exists a nonnegative 
integer n such that G^ n+1 l = {e}. 

Proof. If Gt n+1 i = {e} for some nonnegative integer n, then G has a central 
series 

{e} = G [n+l] C G [n] C • • • C G [1] = G. 

Hence, G is nilpotent. Conversely, suppose that G is nilpotent. Then there 
exists a central series 

{e} - G 0 C Gi C G 2 C • • ■ C G n = G 

of G. We now show that G^ C G n _;+i for all z = 1, 2, ..., n + 1. Clearly, G^ = 
G = G n . Suppose G^ C G n -i+\ for some z, 1 < z < n + 1. Now G;+i/G* C 
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Z(G/Gi ), i = 0,1 — 1. Therefore, by Lemma 8.2.6, [Gi+i,G] C G {, 

z = 0,1,..., n — 1. This implies that C [G n -i+i,G] C G n -i . 

Thus, by induction, C (7 n _i+i for all z = 1, 2, ..., n + 1. Consequently, 
G ln+11 CG 0 = {e}.l 

Theorem 8.2.9 Let G be a nilpotent group. Then every subgroup of G is 
nilpotent. 

Proof. Let H be a subgroup of G. There exists a positive integer n such 
that Gl n+1 ) = {e}. Now H W - H C G = G^l Suppose H® C G® for 1 < i < 
n- 1-1. Then — [ifM ; if] C [G^,(G] = G^ +1 l. Therefore, by induction, 

/ft*] C (?[ l l for all z = 1,2, ..., n + 1. Hence, C Gt n+1 ] = {e}, proving 

that H is nilpotent. ■ 

Lemma 8.2.10 Let G , if, and K be groups such that G = H x if. Then 
Zi(G) = Zi{H) x Z,(iT) /or all i = 1, 2, .... 

Proof. For z - 1, Zi(<?) = Z(G) = Z(H x K) = Z{H) x Z(K) = Z X {H) x 
Z\{K). Thus, the lemma is true for i = 1. Suppose Zi(G) — Zi(H) x Zi(K) for 
some z > 1. Now Z^+i (G) is the unique normal subgroup of G such that Z{(G) C 
Zi + i(G) and Zi+\(G)/Zi(G) = Z{G/Zi(G)). Consider the isomorphism 'ip : 

H/Zi(H) x KjZi(K) — (if x K)/Zi{H x K ). Now 

Z{G/Zi{G )) - Z((H xK)/Zi(H xK)) 

= Z((H x K)/Zi(H) x Z z {K)) (by the 

induction hypothesis) 

= Z(xh((H/Z,(H))x(K/Zi(K)))) 

= i P(Z((H/Zi(H)) x {K!Zi{K)))) 

= 4>{Z(H/Zi(H)) x Z(K/Zi(K))) 

= MlZ,+i(H)/Z,(H)) x (2 i+1 (A')/Z i (if))) 

= (Zi+i(ff) x Z i+1 (K))/(Zi(H) x Zi(if)) 

= (Z i+ i(ff) x Z i+1 (K))/Zi(H x if) 

= (Z i+1 (H) x Z i+1 (if))/Zi(G). 

Hence, Z i+ i(G) = Z t+ i(H) x Z i+ i(K). ■ 

Lemma 8.2.11 The direct product of two nilpotent groups is a nilpotent group. 

Proof. Let H and K be two nilpotent groups. Then there exists a positive 
integer n such that Z n (H ) — H and Z n (K) = K. Hence, Z n (H x K) = 

Z n (H) x Z n (K ) = H x K by Lemma 8.2.10. Thus, H x K is nilpotent. ■ 

Theorem 8.2.12 Let G,. i = 1 , 2 ,..., n, be a nilpotent group. Then O , x X 
• • • x G n is nilpotent. 
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Proof. The desired result follows by Lemma 8.2.11 and induction. ■ 

The following theorem gives several equivalent conditions of a finite group 
to be a nilpotent group. In particular, the following theorem describes all 
finite nilpotent groups in terms of p-groups. It is an analogue of the primary 
decomposition theorem for finite Abelian groups. 

Theorem 8.2.13 Let G be a finite group. Then the following conditions are 
equivalent. 

(i) G is nilpotent. 

(ii) If H is a proper subgroup of G, then H C Nq(H). 

(Hi) Every m.axim,al subgroup of G is a norm.al subgroup of G. 

(iv) Every Sylow subgroup of G is a normal subgroup of G. 

(v) G is isom,orphic to a direct product of p-groups. 

Proof. (i)=>(ii) Since G is nilpotent, G has a central series 

{e} = G 0 C Gi C G 2 C • • • C G n = G. 

Now Go C H C G — Gn. Hence, we can find an integer m ^ 0 such that 
Gm Q H , but G m + 1 £ H. Thus, there exists a E G m + 1 such that a £ H. Now 
aGm € Z{G/G m ). Therefore, for all h E H, ( aGm)(hG m ) — (hG m )(Q'G m )- 
This implies h~ 1 a~ l ha = (ah) _1 ha E Gm C H. Hence, a~ l ha E if, and so 
a -1 if a C H. Similarly, aHa~ l C H. Thus, H - a -1 (aifa -1 )a C a~ 1 Ha C H 
and so a~ 1 Ha = H. Hence a G N(H). Consequently, H ^ N(H). 

(ii) => (iii) Let H be a maximal subgroup of G. Then H C N(H) C G. Since 
H is maximal, N (if) = G. Thus, H is normal. 

(iii) =>(iv) Let P be a Sylow p-subgroup of G such that P is not normal. 
Since G is finite, there exists a maximal subgroup if of G such that N(P ) C if. 
By (iii), if is a normal subgroup of G. Let a G G. Then aPa~ l C aN{P)a~ 1 C 
aifa -1 = if. Hence, P and aPa~ l are Sylow p-subgroups of if. Thus, there 
exists h G if such that h{aPa~ l )h~ l — P. Therefore, ha G N(P) C if. This 
implies that a = hr 1 (ha) G if. Hence, G = H : a contradiction. Thus, P is a 
normal subgroup of G. 

(iv) =>(v) By Worked-Out Exercise 7 (page 208), G is a direct product of its 
Sylow p-subgroups. Since every Sylow p-subgroup is a p-group, G is a direct 
product of p-groups. 

(v) =>(i) The result here follows by Theorems 8.2.4 and 8.2.12. ■ 

We conclude this section by showing that the converse of Lagrange’s theo¬ 
rem holds for finite nilpotent groups. ? 

Theorem 8.2.14 Let G be a. nilpotent group of order m. If n > 0 and, n\m. 
then G contains a subgroup of order n. 
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Proof. If m = 1, then the result is trivially true. Suppose m > 1. There 
exist distinct prime integers p\, p 2 , • ■ ■ ,Vk such that m = p r ^ ^ ■ ■ • p£, where 
are positive integers. Let Hi be the Sylow p-subgroup for p = p z (i = 
1 , 2 ,... ,k) in G. Thus, by Theorem 8.2.13 and Worked-Out Exercise 7 (page 
208), G = H i x H 2 x ■ • • x Hk. Since n|m, there exist integers • Uk such 

that n = p ^ 1 p^ 2 ■ ■ ■ p^ fc . Now \H{\ = p )P and so by Theorem 7.3.1, Hi contains 
a subgroup Ai of order p- { for i = 1, 2,..., k. Thus, B = A\ x A 2 x ■ - • x Ak is 
a subgroup of G of order n. I 

8.2.1 Worked-Out Exercises 

0 Exercise 1 Find a central series Go C G\ C ■ • • C G n in D 4 such that 
Go — {e} and G n = D 4 . 

Solution: D 4 = (a, b ) such that o(a) = 4 , 0(6) = 2 , and ba = a 3 b. Now 

{c} — Go G G\ — {e, u 2 )- C G 2 — {c, a, g. 2 , g 3 } C G n = D 4 

is a normal series in D 4 . Since \D 4 /G\\ = 4 and I.D4/G2I = 2, it follows 
that D4/G 1 and D4/G2 are commutative groups. Thus, G2/G 1 C D\JG\ = 
Z{D 4 /Gi) and D 4 /G 2 C Z(D 4 /G 2 ) = D4/G2 . Since Z(D 4 ) = {e,a 2 } = Gi, it 
follows that G1/G0 Q Z(D 4 /Go). Hence, {e} C {e,a 2 } C {e,a,a 2 ,a 3 } C L> 4 is 
a central series. 

<> Exercise 2 Give an example of a group G such that G is not nilpotent, but 
G contains a normal subgroup H such that H and G/H are nilpotent. 

Solution: The symmetric group S3 is not nilpotent. Now A3 is a normal 
subgroup. Since \A3\ = 3 , A3 is commutative and hence nilpotent. Also, 
IS3/A3I = 2 . Thus, S3/A3 is commutative and so is nilpotent. 

8.2.2 Exercises 

1 . Prove that a homomorphic image of a nilpotent group is nilpotent. 

2 . Prove that a group of order 65 is nilpotent. 

3 . Show that D n is nilpotent if and only if n — 2 m for some positive integer 

m. 

4 . Find ascending central series for S3 and S3 x z 2 . 

5 . Is S3 x S3 nilpotent? 
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Camille Jordan (1838-1921) was 
born on January 5, 1838 in Lyons, France, 
into a well-to-do family. At the age of seven- 
teen he entered the Ecole Polytechnique to 
become an engineer. During his time as an 
engineer, he had ample opportunity to carry 
out his mathematical research and to write 
most of his 120 papers. He retired as an 
engineer in 1885. From 1873 until 1912 he 
taught at the Ecole Polytechnique and the 
College de France. 

Jordan was a universal mathematician. 

He published papers in all branches of math¬ 
ematics of his time. In analysis, he orig¬ 
inated the concept of a bounded function. 

In topology, he showed that a plane can be decomposed into two regions by a simple 
closed curve. 

Primarily, Jordan was an algebraist. He became famous at the age of 30 and for 
the next 40 years he was considered the master of group theory. He was the first to 
develop the theory of finite groups and its applications in the direction of Galois. He 
originated the concept of composition series and proved the first half of the famous 
Jordan-Holder theorem. He studied solvable groups in a very general sense. In 1870, he 
collected all his results on permutation groups for the previous ten years in Traite des 
substitutions. His Traite des substitutions became a bible in all areas of group theory. 
Jordan’s deepest results in algebra were his finiteness theorems. He was joined by 
Felix Klein and Sophus Lie in the study of groups of movements in three-dimensional 
space. 

His Course d’analyse , published in the early 1880s, had a great influence on math¬ 
ematics and set the standard for rigor. In this book, he showed how multiple integrals 
can be evaluated by successive integrations. 

In his study of solvable groups, he made extensive use of concepts such as normal 
subgroup, homomorphic images of a group, and quotient groups. He was the first one 
to use the term “simple group.” 

He died on January 22, 1921. 
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Otto Ludwig Holder (1859-1937) 
was born on December 22, 1859, in Stutt¬ 
gart, Germany. His father was a professor of 
French. He received his early education in 
Stuttgart. On a colleague’s suggestion, his 
father sent him to Berlin in 1877. At that 
time, Weierstrass, Kronecker and Rummer 
were teaching there. 

In his dissertation, presented in 1882, 
Holder developed the continuity condition 
for volume density that bears his name. He 
gave the first complete general proof of Weier- 
strass’s theorem and also examined the con¬ 
vergence of the Fourier series of a function, which was not assumed to be either con¬ 
tinuous or bounded. 

After receiving his doctorate, Holder attended Kronecker’s and Klein’s seminar and 
became interested in group theory. He completed the proof of the so-called Jordan- 
Holder theorem on composition series by showing the uniqueness of the factor group, 
which is now a fundamental concept in group theory. He also studied simple groups. 
Other than the known simple groups of order 60 and 168, he showed that there is no 
other simple group of composite order less that 200. He also investigated the structure 
of groups of orders p 3 , pq 2 , pqr, p 4 , and tl, where p,q,r are primes and n is a square 
free integer. He also worked on geometry and number theory. Holder died on August 
29, 1937. 
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Chapter 9 

Finitely Generated Abelian 
Groups 


The second source in the evolution of group theory, namely, number theory, 
led to the specialized theory of Abelian groups. 

In this chapter, we determine the structural properties of finite Abelian 
groups and finitely generated Abelian groups. In Section 4.2, it was shown 
that every cyclic group is Abelian. In Section 5.1, it was proved that any two 
finite cyclic groups of the same order are isomorphic and thus for any positive 
integer n, Z n is the only cyclic group of order n (up to isomorphism). That 
an infinite cyclic group is isomorphic to Z was shown in Section 5.1. Hence, 
all cyclic groups have been determined. In this chapter, it is proved that 
any finitely generated (and hence any finite) Abelian group can be expressed 
as a direct sum of cyclic groups. Thus, the structural properties of a finitely 
generated (finite) Abelian group can be determined from those of cyclic groups. 

In this chapter, we use additive notation for the group operation. 0 will 
denote the identity element and —a will denote the inverse of an element a. 
The direct product (internal or external) G x H of groups (subgroups) will be 
written as G © H and called the direct sum of G and H. 

Let G be an Abelian group. By Theorem 6.1.4, G is the direct sum of 
subgroups G\, (?2> • • -, G n if and only if 

(i) G = G\ + G 2 + ■ • • + G n (i.e., for all <7 £ G, g = g\ 4- <72 + ■ • ■ + 9 n for 
some gi G Gi, i = 1 , 2,. .., n) and 

(ii) G{ n (Gn T ■ ■ ■ T Gi —1 -r GiJ-\ T ■ ■ ■ T Gn) = {0} for all i — 1, 2,..., tl. 

If G is a direct sum of subgroups G\, G^, -. -, G n , then we write 

G — G\ © Go © • • • © G n . 

If G = G\ ©G r 2 ©- ■ ■®G f n and Gi ~ Hi, where Hi is a group, i = 1,2,... ,n, 
then 

G ~ Hi 0 H 2 © • ■ ■ 0 H n . 
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9.1 Finite Abelian Groups 

Given a positive integer n, the cyclic groups of order n have been completely 
determined. We can determine the subgroups, homomorphic images, and gen¬ 
erators of such groups. Now every cyclic group is Abelian, but not conversely. 

Given any positive integer n, what can we say about an Abelian group of order 
n? How many different Abelian groups of a given order are there? What can 
we say about the subgroups of such groups? In this section, we attempt to 
answer such questions. The main theorem of this section is that every finite 
Abelian group is a finite direct sum of a finite number of cyclic p-groups. We 
will use this theorem to answer some of the above questions. We begin with 
the following definition. 

Let G be an Abelian group and A be a subgroup of G. Then A is called a 
direct summand of G if there exists a subgroup B of G such that 

G = A © B. 

We leave the proof of the following theorem as an exercise. 

Theorem 9.1.1 Let G be an Abelian group. Let r £ Z and p be a prim,e. 

(i) Let G[r] — {g £ G \ rg = 0}. Then G[r} is a subgroup of G. 

(ii) Let rG = {rg | g £ G}. Then rG is a subgroup of G. 

(in) Let G(p) = {g £ G \ g is of order p s for some s > 0}. Then G{p) is a 
subgroup of G. 

(iv) G/G[r\ ~ rG. M 

Definition 9.1.2 The subgroup G(p) of Theorem, 9.1.1 is called a p-primary 
component of G. 

Let G be a finite Abelian group of order p l for some l £ N. Since the order 
of each element of G divides the order of G, the order of each element is p T for 
some r, 0 < r < 1. Therefore, there exists q€G such that o(a) > o(6) for all 
6 £ G. Hence, the corresponding cyclic subgroup (a) is of maximal order in G. 

In the next theorem, we show that (a) is a direct summand of G. 

Theorem 9.1.3 Let G be a finite Abelian group of order p l for som,e l £ N, 
p a prim,e. Let a £ G be such that o (a) = p k is the largest in G. Then (a) 
is a direct summ,and of G, i.e., there exists a subgroup B of G such that G = 

(a) © B. 

Proof. Let 0 t - x £ G. Since |G| = p l , o(x) = p l for some positive integer t. 

Also, o(a) > o(x) and so t < k. Therefore, p k x = 0 for all x £ G. Let 

C = {B | B is a subgroup of G and (a) D B = {0}}. 
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Since {0} G C, C Also, C contains only a finite number of subgroups. 

Hence, C has a maximal element, say, B. We show that G = (a) ©jB. Suppose 
there exists g E G such that g £ (a) 0 B. Since p k g = 0 E (a) © B, there exists 
a positive integer s such that p s g E (a) © B. Let n be the smallest positive 
integer such that p n g E {a) © B, i.e., p n g E (a) © B , but p n ~ 1 g ^ (a) © B. 
Write d = p n ~ 1 g. Then d ^ (a) © B and pd E (a) © B. Now pd = ta + 6 
for some t E Z and b E B. Therefore, 0 = p k ~ 1 pd = p k ~ l ta + p k ~ l b. Thus, 
p k ~ l ta = ~p k ~ l b E (a) D B and so p k ~ 1 ta = 0. Then o(a) = p k must divide 
p k ~ l t and so p\t. Let t — pr and a' — ra E (a) . Then pd = pa' + 6 or p(d — a') = 
& E B. Write x = d — a'. Then x = d — a' — d — ra ^ B and this shows 


that (a) D {B,x) ^ {0}^Hence, there exist m,s E Z and b\ E B such that 
0 ^ m,a = &i + sx. If gcd(p, s) ^ 1, then s = pq for some q E Z. Since px E B, 
ma = &i +q(px ) E B, which contradicts the fact that (a) fl B = {0}. Therefore, 
gcd(p, s) — 1, which implies that there exist u,v E Z such that 1 = us + vp. 
Thus, x — u(sx)+v(px ) = u{rna— 6 1 )+u(px) = «ma+(-u6i©u(pa:)) E (a)©H, 
i.e., d — a' = x E (a) © B. But then d — d — o! F a! E (a) © B, which is a 
contradiction since d (a) © B. Hence, G = (a) © B. ■ 


Example 9.1.4 Let G be a noncyclic group of order p 2 . Since \G\ —p 2 , G is 
Abelian. By Cauchy’s theorem, there exists a E G such that o(a) = p. Since G 
is not cyclic, G does not contain any elem.ent of order p 1 . Therefore, o(a) is 
the largest in G. Thus, there exists a subgroup B of G such that 


G — (a) © B. 


Since \G\ = |{a)| • |J3|, it follows that \B\ = p. This shows that B is a cyclic 
group of order p and (a) ~ Z p cx B. Hence, 

G — Z p © Z p. 

In the next theorem, we prove that any nontrivial Abelian p-group can be 
expressed uniquely as a direct sum of nontrivial cyclic p-groups. 


Theorem 9.1.5 Let G be a finite Abelian p-group, p a prime. Then G is 
a direct sum. of cyclic p-groups. Furth.erm.ore, if G = G\ © G 2 © ■ * • © G r 
= H\ © H 2 © • • • © H s , where Gi and Hj are cyclic p-groups, |Gi| > |d? 2 1 > 

■ • • > \G r \ > 1, and \H\\ > \H- 2 \ > • • • > \H S \ > 1, then r = s and Gi ~ Hi , 
1 < i < r. 


Proof. Let |G| = p n . We prove the result by induction on n. If n = 1, then 
G is a cyclic group of order p and so in this case the result is trivially true. 
Suppose the result is true for all p-groups of order less than the order of G. 
Let a E G be such that o(a) is the largest in G. Then by Theorem 9.1.3, there 
exists a subgroup B of G such that G = (a) © B. Now B is a p-group and 
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\B\ < |G|. Therefore, by the induction hypothesis, B is a direct sum of cyclic 
p-groups and therefore G is a direct sum of cyclic p-groups. We now prove the 
uniqueness part. 

We first note that G\p] and G{\p] are subgroups of G and Gi, respectively. 
Let a € G\p\. Then a = a\ + 02 + ■ • ■ + a T for some a* G Gi, 1 < i < r. Now 
pai + pa 2 + ■ ■ - + pa r = pa = 0. Hence, pai = 0 for all 1 < i < r. Thus, a; € 
Gi\p\ for all 1 < i < r. Therefore, G\p] = G\ [p] ® G 2 [p] © • • ■ © G r [p]. Since G z [p] 
is a cyclic group such that every nonidentity element is of order p, |G 2 [p]| = p 
for all 1 < i < r. Thus, 

\G\p]\ = |C?i[p]| |G 2 [p]| • • ■ |G r [p]| = p r . 

By a similar argument, \G[p]\ = p s since G — H\ © H 2 © ■ ■ •© H s . Thus, p r = p s 
and so r — s. Now since cyclic groups of the same order are isomorphic, in order 
to show that Gi ~ Hi, 1 < i < r, it suffices to show that \Gi\ — \Hi\, l < i < r. 
We prove this by induction on n. If n = 1, then the result is trivially true. 
Suppose that the result is true for all p-groups of order less than p n , where 
n > 1 . By Theorem 9 . 1 . 1 (iv), Gi/Gi\p\ ~ pGi- Since Gi is cyclic, Gi\p] is cyclic. 
Also, since every nonidentity element of Gi\p] is of order p, |Gj[p]| = p. Thus, 
\pGi\ = < |Gi|. This implies that pG z = {0} if and only if \Gi\ — p. Now 

if pGi ~ -fo}, then pGi = {0} for alH < l < r. Thus, pG = pG\ 0 • • • © pG m , 
where m < r, pGi { 0 }, 1 < i < m, and pGi = { 0 }, m +' 1 < l < r. Similarly, 
pG = pH\ 0 • • • 0 pH t , where t < r, pHi ^ {0}, 1 < l < t and pHi = {0} for 
all t + 1 < l < r. Since \pG\ < \G\, m — t and \pGi\ — \pH l \ for all 1 < i < m, 
by the induction hypothesis, and therefore |G Z | = \H 1 \ for all 1 < i < m. 
Also, \Gi\ = p = \Hi\ for all m + 1 < i < r. Consequently, \G Z \ = \Hi\ for all 
1 < z < r. ■ 

Example 9.1.6 Let G be an Abelian group of order 8 . Since 8 = 2 3 , G is a, 
2 -group. There exists a £ G such that o(a) is the largest in G. By Ca.uchy’s 
theorem, G has a.n el.em.ent of order 2. Thus, o (a) > 2 and so o(a) = 2 , 4 or 
8 . If o (a) = 8 , then G ~ Zg. If o(a) = 4, then G ~ Z 4 0 Z 2 . Now; suppose that 
o(a) = 2. Bp Theorem. 9.1.3, t/zere exists a subgroup B of G such that 

G = (a) 0 B. 

Then \B\ — 4 = 2~, proving that B is a 2-group. Since o(a) zs /Ae largest in G , 
B has no el.em.ent of order 4. 77ms, B ~ Z 2 0 Z 2 . Hence, 

G ~ Z 2 © Z 2 0 Z 2 . 


Acm Zg /?.as an element of order 8 , Z 4 0 Z 2 /?-as no el.em.ent of order 8 , but has 
an element of order 4 and Z 2 0 Z 2 0 Z 2 has no el.em.ent of order 4 or 8 . Thus, 
Zg., Z 4 0 Z 2 and Z 2 0 Z 2 © Z 2 are nonisom,orphic groups. Hence, there are 
exactly three (up to isomorphism.) Abelian groups of order 8 . 
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The next theorem is called the fundamental theorem of finite Abelian 
groups. 

Theorem 9.1.7 Let G be a finite Abelian group. Then G is a direct sum, of 
cyclic p-groups. Furthermore, any two decompositions of G as a, direct sum, 
of nontrivial cyclic p-groups are the same except for the order in which the 
summands are arranged. 

Proof. If |G| = 1, then the result follows easily. We now assume that 
|G| > 1. Let \G\ = p^pj 2 ■ • -P?', where the pfi s are distinct primes and the 
nfi s are positive integers. By Theorem 7.3.5, G has a Sylow p^-subgroup, say, 
Gi for alH = 1, 2,..., l. Since G is Abelian, Gi is a normal subgroup of G and 
hence Gi is unique for alH = 1,2,..., l. From Worked-Out Exercise 7 (page 
208), it follows that G is the internal direct sum of Gi, i = 1,2,...,/. However, 
since we are using additive notation, we give details of the proof for the sake 
of completeness. 

Now |C?i| = p n fi for alH = 1,2, ...,/. Hence, Gi D G ; = {0} for all i j. 
We now show that 

Gi D (Gi + ■ ■ • + Gi_i + Gi+i + • • • + Gi) = {0} 

for alH = 1,2, ..., /. Suppose a E GiC l (Gi + ■ • • T Gi_i + Gi+\ + • • • + G/). 
Then a E Gi and a E G\ + ■ • • + Gi_i + Gi+\ + • • • + G{. Hence, 


a = a\ -t- • • • + fli-i + Oi+i + • * ‘ + a h 


where the a 3 E Gj. Now for all j 7 ^ i, o (aj) = p Tj for some r jt 0 < Tj < nj. Let 


r — rE 1 ■ ‘ • T) ri ~ 1 'n ri+1 ■ • • rfi 1 

r — Pi Pi-i Pi+i Pi 


Then ra = 0. Thus, o(a) divides r. Since a E Gi , o(a) divides pf l . But r and 
p are relatively prime. Therefore, o(a) = 1. This implies that a = 0. Hence, 

Gi n (Gi + * • • -r Gi_i + Gi+i + • • • + Gi) — {0}. 


From this, it follows that 

|Gi H-1- G;| = |Gi| • • • |G/| = P\ l p2 2 ’ ‘ ‘Pi 1 — 1^1 ■ 


Thus, 

G = G\ © G ’2 © • • • © G/. 

Now each Gi is an Abelian p-group. Hence, by Theorem 9.1.5, G z is a direct 
sum of cyclic p-groups, whence G is a direct sum of cyclic p-groups. 

We now prove the uniqueness of the direct summands. We prove the result 
by induction on Z, the number of distinct primes in the factorization of |G|. II 
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/ = 1, then G is a p-group and the result is true by Theorem 9 . 1 . 5 . Suppose 
the result is true for all nonzero finite Abelian groups H such that the number 
of distinct primes in the factorization of \H\ is less than l. 

Let 

G — G\ © G2 © ■ • ■ © G r — H\ © H2 © ■ ■ • © Ht 

be two decompositions of G as a direct sum of nontrivial cyclic p-groups. Since 
for groups A ©B ~ £© A, we may assume by rearranging if necessary that the 
summands Gi,G 2 ,..., and Hi, H2,.. ■, H s (m < r, s < t) are the cyclic p- 
groups for the prime pi, the groups G m + 1,..., G r and H s+ \,..., Ht are cyclic 
p-groups for the primes p different from pi, |C?i| > |C?21 > > |C? m | , and 

\H\ | > \H 2 \ > ■ • ■ > | H s |. Let A = Gi © G 2 © • • • © G m , B = H\ © H2 © • ■ • ©Lf 5 , 
C = Gfji+i © • • • © G t , and D = H s +\ © • • • © H^. Then 

G = A®C = B®D. 

We now show that A = B. First note that the order of a nonzero element of 
A and the order of a nonzero element of C are relatively prime. Similarly, the 
order of a nonzero element of B and the order of a nonzero element of D are 
relatively prime. Let a £ A, a ^ 0 . Then a £ G = B © D. Thus, a = b + d for 
some b e B and D. If a — 5 ^ 0 , then the order of a — 6 is some positive 
multiple of pi whereas the order of d is different from any positive multiple of 
Pi. Therefore, we have a contradiction and so a — 6 = 0 ora = 6e_B. This 
implies that A C B. Similarly, B C A and so A = B. A similar argument 
shows that C — D. Now A = B is a p-group and hence by Theorem 9 . 1 . 5 , 
m = s and Gi — Hi , i — 1,2,...,m. Now C = D is an Abelian group of 
order p£ 2 • • • p” z - Hence, by the induction hypothesis, it follows that the two 
decompositions G m+ 1 © • • • © G r and H s+ 1 © • • • © H t of the group C are the 
same except for the order in which the summands are arranged. Consequently, 
the above two decompositions of G are also the same except for the order in 
which the summands are arranged. ■ 

From Theorem 9 . 1 . 7 , it follows that for any finite Abelian group G 7^ {0} 
there is a list of positive integers Pi 1 ,P 2 2 i ■ • • 5 Pk k > which are unique except 
for their order, where pi,P2, • • • ,Pjt are primes (not necessarily distinct) and 
ni, n 2 , ..., nfc are positive integers such that 

G — Z »i © Z »2 © * • • © Z . 

Pi P2 Pfc 

The numbers p^ 1 ,P2 2 , • • • ,p£ fc are called the elementary divisors of G. 
Example 9 . 1.8 Let G be the group Z4 © Zq © Zg. Now 

G — Z22 © Z3 © Z2 © Zg2 — Z2 © Z22 © Z3 © Z32. 

Hence, the elementary divisors of G are 2, 2 2 , 3 , 3 2 . 
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In Section 7 . 2 , we proved by using Cauchy’s theorem that the converse of 
Lagrange’s theorem holds for finite Abelian groups. Next, we prove the same 
result by using the results developed in this chapter. 

Corollary 9.1.9 If G is a finite Abelian group of order n and m is a positive 
divisor of n, then G has a subgroup of order m. 

Proof. If n = 1 , then m — 1 and {e} is the subgroup of order m. Suppose 

n > 1 . By Theorem 9 . 1 . 7 , there exist prime integers pi,p 2 , • ■ • ,Pk and positive 

integers n\, n 2 ,..., n* such that G ~ Z r p 1 0 Z » 2 0 • • • © Z ,n fc . This implies 

Pi V 2 P k 

that n = Pi'p ™ 2 • • • Since m|n, there exist integers 0 < < n 4 , i = 1 , 2 , 

... ,k such that m = p^p ™ 2 ■ ■ -p™ fe . Since p ™ 1 \ p™' for all i , by Theorem 4 . 2 . 10 , 
The cyclic group Z p ^i has a unique subgroup Gi of order p ™ 1 for all i. Thus, 
G' A G2 + • ■ • + Gk — G\ 0 ( 7 2 0 • ■ • 0 Gk is a subgroup of Zn 0 Z ^2 0 • • • © Z 

Pi p 2 p k 

of order p™ x p ™ 2 ■ ■ -p™ k = m. From this, it follows that G has a subgroup of 
order m. ■ 


Let G be a finite Abelian group of order n = Pi ' 1 ?'? 2 '' * p£ fc > where the 
pf s are distinct primes and the nfs are nonnegative integers. Consider the 
subgroup Gi (as defined in the proof of Theorem 9 . 1 . 7 ). Now \Gf\ — pT. From 
this, it follows that Gi C G{pi). Thus, \G(pi)\ > pf 1 . Since G{pf) is a p t -group, 
j( 7 (pi)| = p\ for some integer t. Hence, t > m. Suppose t > n*. By Lagrange’s 
theorem, \G(pi)\ divides |( 7 | . This implies that p\\ p^p ^ 2 • • -p£ fc , which in turn 
implies that p* _7V | p ™ 1 ■ ■ ■ p^Nfp^f ^ 1 • ■ -p^ fc , a contradiction, since the pf s are 
distinct primes. Hence, t = and so Gi = Gipf). From this, we conclude that 
G is a direct sum of its p-primary components. 

Consider the cyclic group Z n . There exist distinct primes pi,p 2 ,... ,pfc and 
positive integers 711,712,... , n*, such that n = p^p ™ 2 ■ ■ -p]! fc . For p = Pi, the 
p-primary component of Z n is Z . Hence, it follows that 

^ i 

Z n — Z nj 0 Z n 2 0 ■ • • 0 Z n k . 

Pi P 2 Pfc 


Example 9.1.10 (i) Let G = Z12. -/Vou> 12 = 2 2 • 3 and so by the previous 
paragraph, G ~ Z 2 2 0 Z3 = Z4 0 Z3. Aon; ( 7 ( 2 ) ~ Z4 and ( 7 ( 3 ) ~ Z3. Hence, 
the primary components are Z4 and Z3. 

(ii) Let G = Z12 0 Zi8 0 Z0o- Nora 12 = 2 2 • 3 , 18 = 2 • 3 2 , and 60 — 2 2 • 3 ■ 5 . 
Thus, 

G = Z12 © Z18 0 Zgo 

— (Z4 © Z3) 0 (Z 2 0 Z9) 0 (Z 4 0 Z3 0 Z5) 

— (Z4 0 Z4 0 Z 2 ) 0 (Z9 0 Z3 0 Z3) 0 Z5. 

This implies that G( 2 ) ~ Z4 0 Z4 © Z 2 , ( 7 ( 3 ) ~ Z9 0 Z3 0 Z3, and ( 7 ( 5 ) ~ Z 5 . 
Hence, the primary components are Z4 0 Z4 0 Z 2 , Z9 0 Z3 0 Z3, and Z5. 
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Definition 9.1.11 Let G be a finite Abelian p-group of order p n (n > 0). 
If G = G i ® G2 © • • * © Gk, where each Gi is a cyclic group of order p ni 
with n\ > 77,2 > ■ • • > n/. > 0, then the integers n\, n %,.. ., Uk are called the 
invariants of G and the k-tuple (n\,n 2, ... ,nk) is called the type of G. 

We know that any two cyclic groups of the same order are isomorphic. 
However, this result does not hold for Abelian groups. For example, Z 8 and 
Z 4 © Z 4 are nonisomorphic Abelian groups of order 8 = 2 3 . In the next theorem, 
we obtain a necessary and sufficient condition for two finite Abelian p-groups 
of the same order to be isomorphic. 

Theorem 9.1.12 Two Abelian p-groups of order p n (n > 0) are isorrwrphic if 
and only if they hare the same invariants. 


Proof. Let G and H be two Abelian p-groups of order p n (n > 0 ). Suppose G 
and H have the same invariants nj, 712, ..., n^, where n\ > 77,2 > ■ • • > njt > 0 . 
Then G = G\ © G2 © • • • © Gk, where each Gi is a cyclic group of order 

p ni , 1 < i < k, and H — H\ © H2 © • ■ ■ © Hk, where each Hi is a cyclic 

group of order p ni , 1 < i < k. Since cyclic groups of the same order are 
isomorphic, Gi cs Hi, 1 < i < k. Hence, G ~ H. Conversely, suppose G ~ H. 
Let G = G\ © G2 © ■ • • © Gk, where each Gi is a cyclic group of order p Ui , 

1 < i < k, n\ > n2 > • • • > n k > 0 , and H = H\ 0 H2 © ■ • • 0 Ht, where 

each Hj is a cyclic group of order p r J, 1 < j < t, r\ > r 2 > • ■ • > rt > 0 . Let 
f : G H be an isomorphism of groups. Then / _ 1 (i/^) is a cyclic subgroup 
of G of order p Ti and also G = / - 1 (ifi) © 1 (^2) © • • • © f~ l {H t ). Hence, by 

Theorem 9 . 1 . 5 , it follows that t = k and p Ti = \f~ 1 (Hi)\ = p ni , 1 < i < k. ■ 

Example 9.1.13 Z 4 ©Z 2 and Z 2 ©Z 2 ©Z 2 are 2 -groups of order 2 3 . Now the 
invariants of Z 4 © Z 2 are 2 , 1 and the invariants of Z 2 © Z 2 © Z 2 are 1 , 1 , 1 . 
Hence, Z 4 © Z 2 and Z 2 © Z 2 © Z 2 are nonisomorphic groups. 


Let n be a positive integer. A partition of n is an s-tuple (ni, 77,2,..., n 3 ) 
of positive integers such that n — n\ + 77.2 + • • • + n s and n\ > n2 > • ■ ■ > n s . 

We find that any finite Abelian p-group G of order p n (n > 0) can be 
decomposed uniquely as G = G\ © G2 © ■ • • © Gk, where each Gi is a cyclic 
group of order p Ui , 1 < i < fc. and n\ > ri2 > ■ • • > nk > 0 . It is also true 
that n = ni + n-2 + • • ■ + n*;. Therefore, ni,n2,...,n* determine a partition of 
n. Next, let n = ni + 77,2 + •••+ 77.^, where each n ? - is a positive integer and 
n\ > n2 > • • • > nk- Then G = Z p ^i © Z p n 2 © • • • © Z p ^. k is an Abelian p-group 
of order p n i+ n2H tn fc _ pn guc ^ that the invariants of G are ni,n2,..., n^. 
It now follows that the number of nonisomorphic Abelian p-groupsof order p n 
(n > 0) is equal to the number of partitions of n. 
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Example 9.1.14 Let p = 2 and n = 4. In this example, we want to describe 
all Abelian groups of order 2 4 . Now 1 + 1 + 1 + 1 , 2 + 1 + 1 , 3+1, 2 + 2, and 
4 are all the partitions of 4. Thus, there are five nonisomorphic Abelian groups 
of order 2 4 . They are 

Zi6 

Zg © Z2 
Z4 © Z4 
Z4 © Z2 © Z2 
Z2 © Z2 © Z2 © Z 2 - 

9.1.1 Worked-Out Exercises 

0 Exercise 1 Describe all Abelian groups of order 2 5 . 

Solution: 5=1+1+1+1+1=2+1+1+1=3+1+1=4+1= 
3 + 2 = 2 + 2 + l. Thus, there are seven partitions of 5 and so there exist seven 
nonisomorphic 2-groups of order 2 5 . They are 

Z32 

Z16 © Z2 

Zg © Z4 

Zg © Z2 © Z2 

Z4 © Z4 © Z2 

Z4 © Z2 © Z2 © Z2 

Z2 © Z2 © Z2 © Z2 © Z2. 

0 Exercise 2 Find all Abelian groups of order 20. 

Solution: Let G be an Abelian group of order 20. Now 20 = 2 2 • 5. By 
Theorem 7.3.5, G has a Sylow 5-subgroup, say, G( 5) and a Sylow 2-subgroup, 
say, (7(2). Since G is Abelian, (7(2) and (7(5) are normal subgroups of G and 
hence are unique. Now (7(2) n (7(5) = {0}. This implies that |(7(2) + (7(5)| = 

|(7(2)| • |(7(5)| = 4 • 5 = 20. Thus, G = (7(2) + (7(5). Hence, G = (7(2) © (7(5). 
Now G( 5) ~ Z 5 . Since |(7(2)| = 4 = 2 2 , either (7(2) ~ Z 4 or G{ 2) ~ Z 2 © Z 2 . 
Therefore, either (7 ~ Z 5 0 Z 4 or G ~ Z 5 © Z 2 © Z 2 . Thus, there are two 
Abelian groups of order 20 (up to isomorphism). 

<0 Exercise 3 Find all Abelian groups of order 63, which contain an element 
of order 21. 

Solution: Let G be an Abelian group of order 63 = 3 2 ■ 7. Then (7 = 
(7(3) ©(7(7), where (7(3) is a 3-group of order 3 2 and (7(7) is a 7-group of order 
7. Now 2=1 + 1 shows that either (7(3) ~ Z32 or G( 3) ~ Z3 © Z3. Hence, 
Zg © Z7 and Z3 © Z3 © Z 7 are the only two nonisomorphic Abelian groups of 
order 63. Now in Z 9 ©Z 7 , ([3], [1]) is an element of order 21 and in Z 3 ©Z 3 ©Z 7 , 
([ 0 ], [ 1 ], [ 1 ]) is an element of order 21 . 
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0 Exercise 4 Find all Abelian groups of order 360. 

Solution: Let G be an Abelian group of order 360 = 2 3 • 3 2 • 5. Now G has 
a unique Sylow 2-subgroup, say, G{ 2), a unique Sylow 3-subgroup, say, (7(3), 
and a unique Sylow 5-subgroup, say, (7(5). Thus, G = (7( 2 ) © (7(3) ® G(5) and 
|G(2)| - 2 3 , |(7(3)| = 3 2 , and |<7(5)| = 5. Now 3 = 1 + 1 + 1 = 2 + 1 and so 
there are three partitions of 3. This implies that there are three nonisomorphic 
Abelian groups of order 2 3 . Hence, 

(7(2) — Zg or (7(2) — Z 4 ® Z 2 or (7(2) ~ Z 2 © Z 2 © Z 2 . 

Similarly, since 2 = 1 + 1 , there are two partitions of 2. Therefore, 

either (7(3) ~ Z 9 or (7(3) ~ Z 3 © Z 3 . 


Since |(7(5)| — 5, 

( 7 ( 5 ) ~ Zg. 

Hence, (7 is isomorphic to one of the following groups 

Zg © Z9 © Z5 

Z4 © Z2 © Z9 © Z5 

Z2 © Z2 © Z2 © Z9 © Z5 

Z§ © Z3 © Z3 © Z5 

Z4 © Z2 © Z3 © Z3 © Z5 

Z2 © Z2 © Z2 © Z3 © Z3 © Z5. 

None of these groups is isomorphic to each other. Consequently, there are six 
Abelian groups of order 360 (up to isomorphism). 


0 Exercise 5 Find the elementary divisors of the group Z 20 © Z 8 © Z 50 * 


Solution: Let (7 = Z20 © Zg © Z50. Then 

(7 = Z20 © Zg © Z50 

— (Z5 © Z4) © Zs © (Z25 © Z2) 
— Z5 © Z22 © Z23 © Z52 © Z2 
— Z2 © Z22 © Z23 © Z5 © Z52. 

Hence, the elementary divisors are 2,2 2 ,2 3 , 5, 5 2 . 


<)> Exercise 6 Let G and H be finite Abelian groups. 

(i) Let / : (7 —* H be a homomorphism. Show that f(G(p)) C H(p) for 
all primes p. 

(ii) Prove that G ~ H if and only if G(p) ~ H(p ) for all primes p. 

~Pu*uc- 
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Solution: (i) Let a G G(p). Then p k a — 0 for some k > 0. Thus, 0 = 
f(p k a) =p k f(a). Hence, f(a ) G H(p). Thus, /(G(p)) C 77 (p). 

(ii) Suppose G — H and let / : G —> H be the isomorphism of G onto 
H. Let p be a prime and a — /|g(p)> be., a the restriction of / to G(p). 
By (i), a : G(p) ■—» 77 (p). Clearly a is a monomorphism. Let h G 77(p). 
There exists a G G such that /(a) = h. Also, p k h = 0 for some k > 0. 
This implies that f(p k a ) = p k f(a ) = p k h — 0, which in turn implies that 
p k a = 0 since / is one-one. Hence, a G G(p) and so h = f (a) = a(a). Thus, a 
is an isomorphism of G(p) onto H (p), proving that G(p) ~ Lf (p). Conversely, 
suppose that G(p ) ~ 77(p) for all primes p. Let G — G(p\) ®G(p2) ©■ ■ -®C?(p*:) 
and 77 = 77(pi) ® 77(P 2 ) ® ■ • * ® H{pk)- Then G(pi) ^ H(Pi ) for all i. Let 
fi : G{jpi ) —► 77(pi) be an isomorphism of G(pi) onto 77(pi). Define f : G H 

by /(P 1 +P 2 +* ■ -+^jfc) = fi{9i)+ f2{92) + - fAW- Then / is an isomorphism 

of G onto 77. Hence, (7 ~ 77. 

9.1.2 Exercises 

1 . Let G be an Abelian group of order pq , where p and q are distinct primes. 
Show that G — Z p ® Z g . 

2. Find all Abelian groups of orders 9, 16, 27, and 32. 

3. Find all Abelian groups of orders 15 and 21. 

4. Find all Abelian groups of orders 60, 80, 240, and 540. 

5. Prove that if G is an Abelian group of order 3-7-11, then G is cyclic. 

6 . Find the elementary divisors of the following groups. 

(i) Z 12 ® Z 144 ® Zg. 

(ii) Z 10 ® Z 30 ® Z 120 - 

7. Let A, B, and C be finite Abelian groups such that A ® B ~ A ® C. 
Prove that B ~ C. 

8 . Let G be an Abelian group such that G = G 1 © G 2 , where G\ and G 2 
are subgroups of G. Suppose that G = H\ ® 772 , where Hi is a subgroup 
of Gi, i = 1,2. Prove that H t = Gi , i = 1,2. 

9. Determine all Abelian groups of order p 4 , where p is a prime. 

10. Find all Abelian groups of order p 3 g 2 , where p and q are distinct primes. 

11. Find all Abelian groups of order 72 which contain exactly three subgroups 
of order 2 . 
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12. Prove that an Abelian group of order 8 is cyclic if and only if it has only 
one subgroup of order 2 . 

13. Prove that a finite Abelian group is cyclic if and only if all of its Sylow 
subgroups are cyclic. 

14. Prove that a finite Abelian group of order n is cyclic if n is not divisible 
by p 2 for any prime p. 

15. Find the number of elements of order 3 in a finite Abelian group of order 

120 . 

16. Show that every Abelian group of order 28 has an element of order 14. 

17. Find all Abelian groups of order 81 that have an element of order 27. 

18. Which of the following statements are true? Justify your answer. 

(i) There is only one (up to isomorphism) Abelian group of order 35. 

(ii) The groups Z 5 © Z 3 © Z 5 © Z 3 and Z 5 © Z 5 © Z 9 are isomorphic. 

(iii) The number of nonisomorphic Abelian groups of order 3 4 is the same 
as the number of nonisomorphic Abelian groups of order 7. 

9.2 Finitely Generated Abelian Groups 

A finite direct sum of cyclic groups need not be a cyclic group. For example, 
Z 2 0 7iq © Z is not a cyclic group. This group has elements of finite as well as 
of infinite orders. However, it is an Abelian group. Now 

([1], [0], o), ([0], [1], o), «0], [0], 1 ) € z 2 e Z 6 ® z 

and any element of this group can be expressed as 

m([l], [ 0 ], 0 ) + n 2 ([ 0 ], ( 1 ], 0 ) + n 3 ([ 0 ], [ 0 ], 1 ) 

for some integers n\, r?, 2 , n$. A group of this kind is called a finitely generated 
Abelian group and is the subject of this section. Since a finite Abelian group 
has only finitely many elements, a finite Abelian group is obviously a finitely 
generated Abelian group. The main objective of this section is to give a com¬ 
plete description (up to isomorphism) of all possible types of finitely generated 
Abelian groups. 

Definition 9.2.1 A group G is called finitely generated if there exists a 
finite nonempty set X C G such that G = (X) . In this case, we call X a 

generating set for G. 
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Let G be a finitely generated Abelian group generated by X, where X ~ 
{&!, 02, ..., Gfc}. Then G = {nioi + 72202 H - + n^a k | ni E Z, 1 < i < k}. 

Definition 9.2.2 Let G be an Abelian group. Let X = {01,02,..., o^} be a 
finite nonempty subset of G. X is called a basis for G if G — (X) and for all 

ni 6 Z, 1 < i < k, niOi + 722*22 +-f- — 0 implies that n{ = 0 , l < i < k 

(i.e., X is linearly independent). 

An Abelian group G is called a finitely generated free Abelian group 
if G has a finite basis. 

Theorem 9.2.3 Let G be an Abelian group. Then the following conditions are 
equivalent. 

(i) G has a finite basis. 

(ii) G is the finite (internal) direct sum of a family of infinite cyclic sub¬ 
groups. 

(in) G is isom,orphic to a finite direct sum, of finite copies ofZ. 


Proof. (i)=>(ii): Let X = { 01 , 02 ,... , 0 *,} be a basis of G. Let nai = 0 for 

some n 6 Z. Then Oai + ■ • • + nm +-b Oa^ = 0. Hence, n = 0. This implies 

that ai is of infinite order and (af) is an infinite cyclic group, 1 < i < k. It is 

easy to verify that G — (ai) © ■ • • ® (afi) . 

(ii) =^>(iii): Let G = G\ 0 • • • © Gk, where Gi is an infinite cyclic subgroup 

of G, 1 < i < k. Then Gi — Z, 1 < i < k. Hence, G ~ Z © • • • © Z. 

(iii) =>(i): Suppose G~Z®---©Zisa finite direct sum of k copies of 
Z. Let Z( fc ) denote Z © - • • © Z and / : G —» Z ^ be an isomorphism. Let 
m = (0,..., 0,1,0,..., 0) G Z^ k \ with the ith component 1,1 < i < k. Then 
since / is onto Z^ k \ there exists ai E G such that /(afi) = Ui, 1 < i < k. Now 
it is easy to verify that X — {ai, 02 ,...,a^} is a basis of G. I 


From the above theorem, it follows that in a finitely generated free Abelian 
group every nonzero element is of infinite order and that a finite Abelian group, 
though finitely generated, cannot be a finitely generated free Abelian group. 
Also, from the above theorem, we can draw an interesting conclusion that for 
every positive integer n, there exists a finitely generated free Abelian group 
with a basis consisting of n elements. 

Consider the finitely generated free Abelian group Z©Z. Now {(1, 0), (0,1)} 
and {(—1, 0), (0, —1)} are two different bases of Z®Z. Thus, a finitely generated 
free Abelian group may have more than one basis. However, the number of 
elements in each basis is the same as proved in the next theorem. 


Theorem 9.2.4 Let F be a finitely generated free Abelian group. Then any 
two bases of F have the same number of elements. 
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Proof. Let X — {ai, a- 2 ,..., a^} and Y = {b\, 62 ,..., b r } be two bases of 
F. Then F~Z©-*-©Zisa finite direct sum of k copies of Z. Now 2 F is a 
subgroup of F and 2 F ~ 2Z © • ■ ■ © 2Z. Hence, 


F 
2 F 


Z Z 

_ ON 

- /-v VL/ -v r—r M© 

2Z 2Z 


Z 

0 2Z ‘ 


k summands 


This implies that |F/2F| = 2 k . Similarly, since Y is a basis of F, \F/2F\ = 2 r . 
Thus, 2 k = 2 r and so k = r. ■ 


Let F be a finitely generated free Abelian group. The number of elements 
in a basis of F, which is unique by Theorem 9.2.4, is called the rank of F. 

Theorem 9.2.5 Every finitely generated Abelian group is a hom,om,orphic im¬ 
age of a finitely generated free Abelian group. 


Proof. Let G be a finitely generated Abelian group generated by X — 
{ai, < 22 , • • •, a/c}- Let F be a finitely generated free Abelian group of rank k and 
let {rr 1 , £ 2 ,..., £fc} be a basis for F. Define 

f : F G 


by 

f ijX l£l © n 22?2 © " ‘ ’ © 'R'k'^k) ri^CL^ © 77-202 © ' " " © ^2A©A; 

for all rii £ Z, 1 < i < k. Let n t , m n (E Z, 1 < i < k be such that ni£i + n2X2 + 

• • - + nfc£fc = mi£i + 777-2£2 + • • • + m.^Xk- Then ( n\ — 777-1 )mi + (772 — 777 - 2)£2 + 

• • • + (n* — m,fc}Xk = 0. Hence, — m,i = 0 for all 1 < i < k and so = m-i for 
all 1 < i < k. Thus, f is well defined. Also, / is an epimorphism and hence G 
is a homomorphic image of F. ■ 

Lemma 9 . 2.6 Let F = (x) , x € F, be a free Abelian group. Then for all 
m E Z, 771 > 0 , 

F/ (mx) ~ Z m . 


Proof. Define 

/ : F —* Z 


by 


f(nx) = [n 


for all n G Z, Let 77i£ — n 2 X. Then ( 77.1 — 7?-2)£ — 0 and so 77-1 = 772 . Hence, 
[77-1] = [772]. Therefore, / is well defined. It is easy to verify that / is an epi¬ 
morphism. Now 77£ € Ker / if and only if f(nx) = [0] if and only if [ 77 ] = [0] if 
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and only if m\n if and only if n — ms for some s E Z if and only if nx = msz 
for some s E Z if and only if nx E (mx). This implies that Ker / = (mi) . 
Thus, F/ (mx) ~ Z m . ■ 

The proof of the following lemma is straightforward and we leave it as ar 
exercise. 

Lemma 9.2.7 Let F be a free Abelian group of rank k. Let {xi, x 2y ..., Xk} be 
a basis of F and n E Z. Then for all i ± j, 1 < i,j < k, {xi,x 2 , ■. • 

Xj + nxi, ,..., x k } is also a basis of F. ■ 

Consider the group Z. Now Z is a free Abelian group of rank 1 and {1} is a 
basis of Z. Every nonzero subgroup of Z is finitely generated and is generated 
by n for some positive integer n. Hence, every nonzero subgroup of Z is also 
free. We extend this result to any finitely generated free Abelian group in the 
next theorem. 

Theorem 9.2.8 Let F be a free Abelian group of rank k and H be a nonzero 
subgroup of F. Then there exists a basis {xi, x 2 ,..., x k } of F, an integer r 
(1 < r < k), and positive integers m 2 ,..., m r such that 2 < i < r 

such that m 2 x 2 ,..., m r x r } is a basis of H. 

Proof. The proof is by induction on k. If k = 1, then F = (x Y ) and since a 
subgroup of a cyclic group is cyclic, H is cyclic. Clearly H = (m^i) for some 
mi > 0. Suppose now that the theorem is true for all free Abelian groups of 
rank < k. Let 

S = {m E Z | m > 0 and there exists a basis {yi ,..., y k } of F 

such that myi + n 2 y 2 H-+ n k y k e H, for some n 2 ,..., n k £ Z}. 

Since H ^ {0}, 5^0. Thus, S contains a smallest positive integer, say, m\. 
This implies that there exists a basis {y\,y 2 , ... ,y k ] of F such that m\y\ + 

^ 2 V 2 ~\ -h n k y k 6 H for some n 2 ,..., n k G Z. Also, for any basis {z u z 2 ,..., z k } 

if F, if siZi + s 2 z 2 +-b s k z k E H for some si, ..., s k E Z, si > 0, then 

m\ < s\. Let h = m\y\ +n 2 y 2 -\ - '-n k y k E H. Now by the division algorithm, 

there exist qi , E Z such that 

rii = q l m l + r i} 0 < r { < m u i = 2 ,3,..., k. 

From this, it follows that h ~ Tn\(yi J rq 2 y 2 + - ■ ■ +q k y k ) + r 2 y 2 + • * - + r k y k . Since 

[2/1 + 923/2-1- fQkVk, 3/2, • • •, Vk} is a basis of T, we find that u = 0, 2 < i < k, 

3y the choice of mi. Hence, mixi = h E H, where x\ — yi + q 2 y 2 -f • —j- q k y k . 
^et K = (y 2 ,... t y k ) . Then AT is a free Abelian group of rank k - l and 
C = (xi) © K. We now claim that H = (m 1 x 1 ) © (H n K). Let a E H. TJien 
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a = tiXi+t2V2F- ■ -+tkyk f°r some t{ E Z, 1 < i < k. By the division algorithm, 
there exist qi,rj E Z such that t\ = q\m\ + r\, 0 < r\ < m\. This implies that 
r\X\ + t2V2 + • • - + tkVk = a — qimiXi E H and so r\ = 0 by the minimality of 
m\. Thus, t2U2 + • ■ • + tkDk € H. Therefore, 

a = q\{mixi) + t 2 y 2 d-h t k yk G (mixi) + (H D F). 

It now follows from F = (xi) 0 K that 

tf = (mixi) ©(iff! K). 

If H fl K = {0}, then H — {m\X\) and the theorem is true. Suppose that 
H n K 7 ^ {0}. Then H n K is a nonzero subgroup of the finitely gener¬ 
ated free Abelian group K. Hence, by the induction hypothesis, there ex¬ 
ists a basis {x 2 ,...,Xk] of K and positive integers r, m, 2 ,... ,mk such that 
{ 777 . 2 x 2 , m r x r } is a basis of H fl K and 7n^_i|m^, 3 < i < r. Clearly 
{xi,...,Xfc} is a basis of F and { 7771 X 1 , 7772 X 2 , m r x r } is a basis of H. 
It only remains to be shown that 7771 ) 777 . 2 . By the division algorithm, there exist 
g, r E Z such that m 2 = qm\ + r, 0 < r < 777.1 - Now {x 2 , Xi + gx 2 , X 3 ,..., x^} is 
a basis of F and rx 2 d- 7 ni(xi +< 7 x 2 ) = 777 . 1 X 1 -(- 7772 X 2 + 07773 X 3 + - • --hOmkXk E H. 
Thus, by the minimality of 77 ii, r = 0, proving that 777 i|t 77 . 2 . ■. 

The next theorem is called the fundamental theorem of finitely gen¬ 
erated Abelian groups. 

Theorem 9.2.9 Let G be a finitely generated nonzero Abelian group. Then G 
is isomorphic to a finite direct sum of cyclic groups, where the finite summands 
(if any) are of orders mi, m 2 , ■. ., m r , m\ > 1, and mi divides 777 ^+ 1 , 1 < i < 
r — 1. 


Proof. Let G be generated by k elements. By Theorem 9.2.5, G is a 
homomorphic image of a free Abelian group F of rank k. Let / : F —> G be a 
homomorphism of F onto G. Then F/Ker / ~ G. If Ker / = {0}, then 

G ~ F ~Z © • • • © Z . 

'-v-' 

k copies 

Suppose now that Ker f { 0 }. By Theorem 9 . 2 . 8 , there exists a basis {xi, X2, 
... ,xjt} of F, an integer r (1 < r < k), and positive integers 777.1,7772,... ,m T 
such that 777.^_ 1 1 rrii , 2 < i < r, and {7771X1,..., 777 r x r } is a basis of Ker /. Now 
F = (xi) © ■ • • © (xk) and Ker / = {7771X1) © • ■ • © (m T x r ) . Hence, 

G ~ F/Ker/ ~ « ■ • ■ © ® (x r+1 ) © • ■ • © (x k ) 

^ z mi © • ■ • ®'Z mr © Z © - ■ • © Z. ■ 

Recall that a group G is torsion free if and only if every nonidentity element 
of G is of infinite order. 


“Pulix*. 7feoiAe»tatL<m/ “Ph.lySlcS. 
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Theorem 9.2.10 A finitely generated Abelian group G / {0} is torsion free 
if and only if G is a finitely generated free Abelian group. 

Proof. Suppose that G is a finitely generated free Abelian group. Then 
there exists a positive integer r such that 

G ~Z © Z © ■ • • ® Z . 

'-V-' 

t copies 

Now every nonidentity element of Z © Z © • • • © Z is of infinite order. Hence, G 
is torsion free. Conversely, suppose that G is a finitely generated torsion free 
Abelian group. Then by Theorem 9.2.9, 

G ~ Z mi © ■ ■ ■ © Z mr e Z 0 - - - 8 Z 

s copies 

for some positive integers mi, m 2 ,..., m T and a nonnegative integer s. If r 7 ^ 0, 

then Z mi ©• • -0 Z mr 0 Z 0 • • • 0 Z and so G contains a nonzero element of finite 

v ---' 

s copies 

order, which contradicts the hypothesis. Hence, r = 0. Thus, G —Z 0 • • ■ 0 Z, 

'' -V ' 

s copies 

proving that G is a finitely generated free Abelian group. ■ 

Theorem 9.2.11 Let G be an Abelian group. Let 

T(G ) = {a £ G | o (a) is finite}. 

Then T(G) is a subgroup of G. Suppose G is finitely generated. If G/T(G) 7 ^ 
{0}, then G/T(G) is a finitely generated free Abelian group. 

Proof. Clearly T(G) is a subgroup of G. It is also a simple exercise to show 
that G/T{G) is finitely generated. Suppose G/T(G ) 7 ^ {0}. Let a + T(G ) £ 
G/T(G). Now n{a + T(G)) = 0 + T(G) if and only if na £ T(G) if and only 
if m{na ) = 0 for some positive integer m if and only if a £ T(G) if and only 
if a + T(G) — 0 + T(G). Hence, G/T(G ) is torsion free. By Theorem 9.2.10, 
G/T(G) is a finitely generated free Abelian group. ■ 

Definition 9.2.12 Let G be an Abelian group. The subgroup T(G) in Theorem, 
9.2.11 is called the torsion subgroup of G. 

£ 

Theorem 9.2.13 Let G be a finitely generated nonzero Abelian group. Let 

G — Z mi 0 • • * 0 Z mfc 0 Z 0 • ■ ■ 0 Z 

S " 1 " S ^ 11 1 y 

r copies 

IPilHjc. 7^a^Lc#fta£uiaZ 
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and 

G — Z ni © ■ • ■ © Z n © Z © • • ■ © Z, 

s copies 

where m,\ > 1 , m,i divides 770 +i, 1 < i < k — 1 , n\ > 1 , and divides n{ + i, 

1 < i < g — 1 . Then k = q, r = s, and m,i — ni, 1 < i < k. 

Proof. Let 

G\ = Z mi © • • • © Z mfc © Z 0 - - . 0 Z 

r copies 

and 

C?2 — Z ni © • ■ • © Z n<7 © z © • • • © z . 

V V / 

s copies 

We first show that the torsion subgroup T{G\) is isomorphic to Z mi © • • • ©Z mfc 
and the torsion subgroup T((? 2 ) °f G2 is isomorphic to Z ni © • • • © Z Uq . Let 
a G G1. There exists Xi G Z mi , i = 1 , 2,..., k and yj G Z> j = 1 , 2,..., r, such 
that a can be written uniquely as 

A ( 3 ? 1 5 27 2 5 • • • j 27 fc) 1/1; 1/2 j • • ■ j1/r) • 

Let m! G N. Then 777/a = (tti'xi, m'x2 ,..., m'x k , rn'y \, 777/7/2, • • ■ , rn'y r ). Since 
yj G Z, we find that m'yj = 0 if and only if yj — 0 . Again for m = 
mim2 ■ • • mxi = 0 , i = 1 , 2 ,..., k. Thus, it follows that o(a) is finite if 
and only if y\ = 1/2 = • • • = y r = 0 and so T(Gi) is the set of all elements 
a = (xi, X2,. ■., Xk, 0 , 0 ,..., 0 ) G G\. Consequently, T{G\) ~ Z mi © • • • © Z mk . 
Similarly, T(G 2 ) — Z ni © ■ • ■ © Z ng . 

Next, let us show that k — q and = 70, i = 1 , 2 ,..., k. Since for groups 
A and B , A © B ~ B © A, we find that Z mi © • • • © Z mfc ~ Z mk © • • ■ © Z mi . For 
convenience, let us write t\ = 777..., t k = m-i, and set G3 = Z tl © • • • © Z tk , 
where L are positive integers, t k > 1 and L + i\L, i = 1 , 2 ,..., k — 1 . Similarly, 
Z ni ©• • • ©Z„ g ~ Z n? ©• • • ©Z ni . For convenience, let us write r\ = n q ,... ,r q — 
ni and set G4 = Z ri © • • • © Z r? . For 1 G G3 there exist 20 G Zt- such that 
x = (xi,..., xjt). Now |Z ti | = L and ^|li, i — 1 , 2 ,...., A:. Also, note that 
Lxi = 0 , i = 1 , 2 ,.., ,fc. Hence, t\a = 0 for all a G G3. Again in Zq, there 
exists an element x\ of order ti, which implies that a — (sq, 0,..., 0) is an 
element of order t\ in G 3. Similarly, we can show that G4 contains an element 
b such that 0(6) = 7*1 and r\y = 0 for all y G G4. Since G3 ~ G4, there exists an 
isomorphism, say, / : G3 —> G4. Now o(/(a)) = t,\ and also r\f(a) = 0 . Thus, 
t\ < r 1. A similar argument shows that rq < t\ and so rq = t\. Suppose now 
that r 2 = i 2 >--- Ti -1 = L-i, hut 7q ^ L, where 1 < i < min (k,q). Let L < Ti 
and let K — {Lx \ x G G3}. It can be shown that K is a subgroup of G3 and 
if Z ti = {0), i = 1 , 2,..., k, then 

K = (Lai) © { La 2 ) © • • ■ © (La k ) 
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and hence 
\K\ = 


>( g l) 


J (Q 2 ) 


K°fc) 


gcd(o(ai),ti) gcd(o (a 2 ),ti) gcd(o (a k )M) 

_£1_£2_... £& 

gcd(£i,q) gcd(£ 2 ,q) gcd (t k ,ti) 

Vt'" VrVfV "'L ^ nCe = 1, 2, . . . , 

— 1 

^ ^j 


-i) 


(9.1) 


Now f(K) = {t;/(x) | x S G 3 }. If Z rj = { bj), j = l,2,...,q, then 


f(K) = (Ubx) © (tj 6 2 ) © • • • ® (Ubg). 


Hence, 


\f(K)\ 


°(M _ o(b 2 ) _ _ . o(b g ) 

gcd(o(6 1 ),^) gcd(o(6 2 ),ti) ’ ' ‘ gcd(o(b q ),ti) 

t 1 _ r 2 _ _ _ r q 

gcd(ri y ti) gcd (r 2 ,ti) ‘ ' ' gcd (r q ,ti) 

1 1 . . . Nzl _ Tj _ t . Tq 

gcd(ti,tj) gcd(ii_i,iti) gcd(ri,tj) gcd(r 7 ,^) 

£l£2 . . . h=l r i _ r <i 

ti U U gcd(T- i ,t i ) gcd {r q ,ti)‘ 


Since \K\ = \ f{K )\, it follows from Eqs. (9.1) and (9.2) that 


gcd (r*,t z ) 


gcd(r g , tf) 


= 1 . 


(9.2) 


(9.3) 


Since L < r it gcd(r 2 ,^) < r { and hence gcd g. u) > 1 . Thus, we find that the 
left-hand side of Eq. (9.3) is greater than 1, whereas the right-hand side of 
Eq. (9.3) is 1. This is a contradiction. This contradiction implies that L ft ri. 
Similarly, n ft t im Hence, L = r^. But G 3 ~ G\ implies that |(? 3 | = |C 4 | and so 
tp 2 • • • tk = r\T 2 • • • r q . Note that L and r % are positive integers greater than 1 . 
If k < q, then L = ri, i = 1 , 2,..., k and hence 1 = r^ + i ■ • - r q , which is not true. 
So A; ft q. Similarly, q ft k. Consequently, k = q and ti — r^, i = 1 ,2,..., k. 

Finally, let us show that r — s. From the assumption and from the above 
proof, it follows that 

G~H®F~H(&F', 

where H is a finite direct sum of finite cyclic groups and F and F' are finitely 
generated free Abelian groups of rank r and s, respectively. The restriction 
of the isomorphism G ~ H ® F maps T{G) onto H. Hence, G/T(G) ~ F, 
which shows that G/T{G) is a finitely generated free Abelian group of rank 
r. Similarly, GjT(G) ~ F' implies that G/T{G) is a finitely generated free 
Abelian group of rank s. Thus, r = s. ■ 


Corollary 9.2.14 Let G be a nonzero finite Abelian group. Then there exists 
a unique list of positive integers (not necessarily distinct) 77 ?q, m 2 ,..., such 
that mi > 1, m;|m i+ i, i = 1,2,... ,k - Land G % Z mi © • ■ • © Z^. . 

ru+ixL. 



9.2. FINITELY GENERATED ABELIAN GROUPS 


266 


Theorems 9.2.9 and 9.2.13 give a complete system of invariants for finitely 
generated Abelian groups. That is, the number r of Theorem 9.2.13 together 
with the integers mi,m 2 ,. -., m* are invariants for finitely generated Abelian 
groups in the sense that any two finitely generated Abelian groups with these 
numbers must be isomorphic. 

Let G be a finitely generated Abelian group. Then the unique number r of 
Theorem 9.2.13 is called thejbetti number of G and the integers mi, m 2 ,..., 
m*., which are uniquely determined for the group G, are called the torsion 
coefficients of G. 


9.2.1 Worked-Out Exercises 

0 Exercise 1 Show that (Q,+) is not finitely generated. 


Solution: Suppose (Q, +) is finitely generated. 


set 


CL\ &2 O^tl 1 


Then there exists a finite 


of rational numbers such that Q = Now we can find a prime 

p such that p does not divide 61 , 62 , • - •, b n . Let x € Q. There exist integers 
r i > r 2 , ■ • ■, r n such that 


a 1 a2 

x — r \-—b 7 - 27 —I-+ r 

01 02 


71 



C 

b\b 2 ■■■b n 


for some integer c. Since p does not divide 61 , b 2 , ..., 6 n , we find that p does 
not divide 6162 • * • b n . Hence, p does not divide the denominator of any rational 
number (expressed in lowest terms) of This implies that ^ ^ 

a contradiction. Thus, (Q, +) is not finitely generated. 


0 Exercise 2 Let G be a nonzero finitely generated Abelian group such that 
every nonzero element of G is of order p, where p is a prime. Show that 
|(j| = p k for some positive integer p. 


Solution: By Theorem 9 . 2 . 9 , G ~ Z mi © 0 Z mfe © Z © — • © Z . If 

r copies 

0, then G contains elements of infinite order. Hence, r — 0 and so G cx 
Z mx © • • • © Zm k ■ Since each nonzero element of G is of order p, we find tha - 
mi = ■ • ■ = mfc - p. Thus, G ~ Z p © • • • © Z p and so |G?| = p k . 


<0> Exercise 3 Show that the torsion subgroup, T{G), of G = Z 4 ©Z©Z 5 ©Z ; : 
is a cyclic group. Find |T(G)| . 

“PuAtx*. 7feoiAe»tatL<ra/ 
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Solution: Recall that Z mn ~ Z m © Z n if and only if gcd(m,n) = 1. Now 

G = Z4 © Z © Z5 © Z3 

^ z 4 © Z 5 © Z 3 © Z 
— Z20 © Z3 © z 
— Zgo © Z. 

Hence, T(G) ~ Zgo and so T(G ) is a cyclic group. Also, |T(G , )| = |Zgo| = 60 . 

0 Exercise 4 Show that there are integers 4 ,...,4 such that 4 > 1, 4|4 +1 , 
^ == 1, and Z 2 2 © Z 3 4 © Z 3 0 Z52 © Z2 — Z dl © Z^ 2 © • • • © Z^ fc . 

Solution: Let G = Z 2 2 © Z 3 4 © Z 3 © Z 5 2 © Z 2 . Then 

G ~ Z2 © Z22 © Z3 © Z 3 4 © Z52. 

Thus, the elementary divisors of G are 2 , 2 2 , 3 , 3 4 , and 5 2 . We form the following 

table: 

2 3 

2 2 3 4 5 2 

From this table, we arrange the summands in the following way: 

G — (Z2 © Z 3 ) © (Z 2 2 © Z 3 4 © Z52) 

— Z 6 © Zgioo- 

Hence, 4 = 6 and 4 = 8100 . 

9.2.2 Exercises 

1 . Show that the group Z © Ziq is finitely generated, but has no basis. 

2. Let G be a finitely generated nonzero Abelian group in which every non¬ 
identity element is of order 2. Show that |G| =2^ for some positive integer 
k. 

3 . Show that the torsion subgroup, T(G), of G = Z 4 © Z 7 © Z © Z © Z 9 is 
a cyclic group. Find \T(G) \. 

4 . Find the torsion coefficients and the betti number of the group Z20 © Z © 
Z © Z15 © Zg. 

5 . Find the elementary divisors of the group G = Z 2 2©Zi 5 ©Z 48 and find the 
positive integers di,d 2 ,. ..,4 such that dx > 1 , 4 | 4 +1 ,« = 1, 2 ,..., k-l, 
and G ~ Z c f 1 © Z d2 © • • • © Z dk . 

6. Find all Abelian groups of order 540 . Express them as a direct sum of 
Abelian groups of the form Z dl © Z^ 2 © • ■ • © Z dk such that 4 , d 2 , ..., 4 
are positive integers and 4 > 1, 4|4+i, f = 1, 2,..., k - 1. 

TPuJuc- 
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7 . Are the following pairs of groups isomorphic? 

(i) Z20 © Z75 © Z90 and Z120 © Z25 © Z45. 

(ii) Z15 © Z12 © Z30 © Z © Z and Z108 © Z50 © Z © Z. 

8. Show that the group Z200 © Z30 © Z36 is isomorphic to Z120 © Zig © Zioo- 

9 . Let G be a finitely generated Abelian group generated by n elements. 
Let H be a subgroup of G. Prove that H is also finitely generated and H 
may be generated by m elements, where m < n. 

10 . Let H be a subgroup of an Abelian group G. If H is finitely generated 
and G/H is finitely generated, prove that G is finitely generated. 

11. Prove that every homomorphic image of a finitely generated Abelian 
group is finitely generated. 

12. Prove that two finitely generated free Abelian groups are isomorphic if 
and only if they have the same rank. 

13 . Prove or disprove: 

(i) In a finitely generated free Abelian group G of rank n, any linearly 
independent subset of n elements is a basis of G. 

(ii) In a finitely generated free Abelian group G of rank n, any linearly 
independent subset of m elements, m < n, can be extended to a basis of 
G. 

(iii) Every finite Abelian group is a finitely generated free Abelian group. 





Chapter 10 

Introduction to Rings 


In the previous chapters, we investigated mathematical systems with one binary 
operation. There are many mathematical systems, called rings, with two binary 
operations. The notion of a ring is an outgrowth of such mathematical systems 
as the integers, rational numbers, real numbers, and complex numbers. 

Although David Hilbert coined the term “ring,” it was E. Noether who, 
under the influence of Hilbert, set down the axioms for rings. In 1914, Fraenkel 
gave the first definition of a ring. However, it is no longer commonly used. 

As we shall see, a ring is a particular combination of a group and a semi¬ 
group. Hence, our previous work will prove helpful in our examination of rings. 
However, it is not enough to examine a set with two independent binary oper¬ 
ations. In order to obtain the full power of the axiomatic approach, we need a 
dependency between the two operations—in particular, the distributive laws. 


10.1 Elementary Properties 

This section parallels Chapter 2. First we give a definition of a ring, followed 
by examples and elementary properties. We introduce several notations and 
definitions which will be used throughout the text. 

The two binary operations that we consider on a nonempty set are usually 
denoted by -I- (addition) and • (multiplication). 

A ring is a mathematical system (R, +, ■) such that (R, +) is a commutative 
group, (R, •) is a semigroup, and the distributive laws hold, i.e., for all a, b. 

ce R, 

a ■ (b -r c) = (a • b) + (a • c), 

(b + c) ■ a = (b ■ a) + (c • a). 

We denote the identity of (R, +) by the symbol 0. The additive inverse of 
an element a E R is denoted by — a. 

We now give a complete definition of a ring. 
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Definition 10.1.1 A ring is an ordered triple (R, +, •) such that R is a nonem¬ 
pty set and 4 - and • are two binary operations on R satisfying the following 
axioms. 

(R1) (a + b) + c = a + (6 + c) for all a,b,c E R. 

(R2) a 4- b — b -f a for all a, b E R. 

(R3) There exists an element 0 in R such that a + 0 = a for all a E R. 

(R4) For all a E R, there exists an element —a 6 R such that 

a 4- (—a) = 0. 

(R5) (a • b) • c = a • (b ■ c) for all a, b, c E R. 

(R 6 ) a ■ (b + c) = (a • 6) -f- (a • c) for all a,b,c E R. 

(R7) (b + c) • a — (b • a) + (c • a) for all a, b, c E R. 

We call 0, the zero element of the ring (R, +, •). 

During the development of the theory of rings, we will use the following 
conventions. 

1. Multiplication is assumed to be performed before addition. 

2. We write ab for a • b. 

3. We write a — b for a + (—6). 

4. We refer to a ring ( R , +, •) as a ring R. 

Accordingly, ab + c stands for (a • b) + c, ab + ac stands for {a ■ 6) + (a ■ c), 
ab — ac stands for (a ■ b) + ( —(a ■ c)), where a, 6, c E R. 

Example 10.1.2 Consider Z, the set of integers, together with the usual ad¬ 
dition, +, a,nd multiplication, -. By Example 2.1.3, (Z, +) is a group. Now 
multiplication of two integers is an integer and associativity holds for *. Fi¬ 
nally, we know that the distributive laws hold for the integers. Thus, (Z,+,-) 
is a ring. 

The ring of Example 10.1.2 is called the ring of integers. This ring plays 
an important role in the study of ring theory. One of the basic problems in 
ring theory is to determine rings, which satisfy the same type of properties as 
the ring of integers. 

Definition 10.1.3 A ring R is called commutative ifab — ba for all a,b E R. 

A ring R which is not commutative is called a noncommutative ring. 

“Public. TfeafAewtafLea/ PAy.J.ZcJ. 
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From the above definition, it follows that a ring R is commutative if and only 
if the semigroup (R, •) is commutative. The ring of integers is a commutative 
ring. 

For a ring R, the set C(R ) = {a E R \ ab — ba for all b E R } is called the 
center of R. It follows that R is commutative if and only if R = C(R). 


Example 10.1.4 Let M 2 (Z) denote the set of all 2 x 2 rn.atri.ces over the ring 
of integers. Let + and • denote the usual matrix addition and multiplication, 
respectively. Since addition (multiplication) of 2x2 m, a trices over Z is a 2 x 2 
m.atrix over Z, it follows that + and, • are binary operations on M 2 (Z). It is now 

easy to show that (M 2 (Z),+,-) is a ring. Now 

and 


5 

7 


1 

8 


E M 2 (Z) 


1 2 
3 4 


5 6 
7 8 


19 22 


23 34 ' 


’5 6 ' 


" 1 2 ~ 

43 50 

7^ 

31 46 

— 

7 8 


3 4 


Therefore, M 2 (Z) is not a commutative ring. 


In a ring R, an element e E R is called an identity element if ea = a — ae 
for all a E R. An identity element of a ring R (if it exists) is an identity 
element of the semigroup (i?, ■). Therefore, a ring cannot contain more than 
one identity element (Theorem 1.6.11). The identity element of a ring (if it 
exists) is denoted by 1. 


Definition 10.1.5 A ring R is called a ring with identity if it has an iden¬ 
tity. 


Example 10.1.6 The ring Z of integers is a ring with identity. The integer 1 
is the identity elem.ent of Z. 


Example 10.1.7 The ring M 2 (Z) of Exam,pie 10.1.4 is a ring with identity. 
The identity elem.ent of Af 2 (Z) is 

Example 10.1.8 Let R denote the set of all functions f : R —► R. Define -h 
■ on R by for all f,gER and for all a€R, 


1 0 
0 1 


if Tg)(a) = /(a) + g(a) t 
(/•P)(a) = f{a)9{a). 

From, the definition of + and •, it follows that + and ■ are binary operations 
on R. Let f,g,hE R. Then for all a E R. we have by using the associativity of 
R that ((/ + g) + h)(a) = (f + g)(a) +_^(a^= 
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G?(°)+M a )) — /( a ) + (y+^)( a ) — if F{gFh))(a). Thus, (f+g)+h — f+(g+h). 
This shows that + is associative. In a similar way, we can show that the other 
properties of a ring hold for R by using the fact that they hold for R. Thus, 
(R, +, •) is a ring. We note that the function io : R —> R, where io(a) — 0 for 
all a £ R, is the additive identity of R and the element i\ £ R, where i\{a) — 1 
for all a £ R, is the identity of R. Also, for all f,g £ R and for all a £ R, 
(/■S)0) = f( a )g( a ) = 9 (a)f(a) = (g-f)(a). Thus, for all /, g £ R, f ■ g = g ■ f. 
Consequently, ( R , +, •) is a commutative ring with identity. 

The addition and multiplication on R in Example 10.1.8 are the same as 
those encountered by the student in calculus. 

Example 10.1.9 Let (G, *) be a commutative group and HomfG , G) be the 
set of all hom,omorphism.s of G into itself. Now the composition of two hom.o- 
morphisms of G is again a homom,orphism. of G and so o is a binary operation 
on Hom,(G,G). Also, o is associative by Theorem, 1.5.13 and iQ £ Hom,(G,G ) 
is the identity. Thus, ( Hom,(G,G ), o) is a semigroup with identity. We now 
define a suitable + on Hom(G, G) so that ( Hom,(G, G),+, o) becomes a ring 
with identity. Define + on Hom(G,G ) by for all f,g£ HomfG, G), 

(/ + 9 ) (a) = /(a) * g{a) for all a £ G. 

Let f,g £ Hom,{G,G). From, the definition of +, it, follows that f + g is a 
mapping from G into G. Let a, b £ G. Then 

(f + g)(ab) = f{ab)*g(ab) 

= U(a)*fib))*{g{a)*g(b)) 

= f(°)*g(a)*f{b)*g(b) 

= if + 2 ) 0 ) * (f + g)(b). 

This shows that f + g is a homomorphism from, G into G. We omit the rou¬ 
tine verification that + is associative. The identity of ( Hom{G,G ), +) is the 
hom,om,orphism that maps every element of G onto the identity of G. For any 
f £ Hom,(G,G ), the mapping —f defined by (—/)(a) = /(a) -1 for all a £ G 
is the additive inverse of f. Thus, (Hom,(G, G), +) is a group. We now show 
that the left distributive law holds. For any a £ G and any elements f, g, h £ 
Hom(G, G), [fo(g + h)](a) = f((g + h){a)) = f{g(a)*h(a)) = f(g(a))*f(h(a)) 
= (f°g)(a)*(f°h)(a) = (fog + foh)(a). Hence, fo(g + h) = (/op) + (/o/i). 
The right distributive law holds similarly. Consequently, ( Hom(G , G),+, o) is 
a ring. 


We now prove some elementary properties of rings. 

Theorem 10 . 1.10 Let R be a ring and a,b,c £ R. Then 
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(i) aO = Oa = 0, 

(ii) a(—b ) = (— a)b — — (a6), 

(iii) (—a)(—6) = ab, 

(iv) a(b — c) — ab — ac and (b — c)a = ba — ca. 

Proof. (i) Observe that a0 + a0 = a(0+0) = aO. Thus, (aO + aO) + (—(aO)) = 
aO + (—(aO)) and so aO + (aO + (—(aO))) = 0. Hence, aO + 0 = 0 or aO = 0. 
Similarly, Oa = 0. 

(ii) ab + a(—b) = a(b + (—6)) = aO — 0 = aO = a (—b + b) = a(—b) + ab. 

Since the additive inverse of an element is unique, a(—6) = — ( ab). Similarly, 

—a) b = —(ab). 

(iii) Using (ii), we have (— a)(—b) - — (a(— b)) — —(—ab) = ab. 

(iv) Since b—c = &+(—c), a(b—c) = a(6+(— c)) — ab+a(—c) = a6+(—(ac)) 
by (ii)) = ab — ac. Similarly, (b — c)a = ba — ca. ■ 

Corollary 10.1.11 Let R be a ring with 1. Then R ^ {0} if and only if the 
dements 0 and 1 are distinct. 

Proof. Suppose R ^ {0}. Let a € R be such that a ^ 0. Suppose 1 = 0. 

Then a = al = aO = 0, a contradiction. Thus, 1^0. The converse follows 
since R has at least two distinct elements 0 and 1. ■ 

Convention: From now on, we assume that the identity element 1 (if it 
exists) is different from the zero element of the ring. 

From this convention, it follows that if R is a ring with 1, then R has at 
least two elements. 

Let R be a ring with 1. An element u 6 R is called a unit (or an invertible 
element) if there exists v G R such that uv — 1 — vu. We note the following 
properties of invertible elements. 

Theorem 10.1.12 Let R be a ring with 1 and T be the set of all units of R. 

Then 

(i) T^4>. 

(ii) 0 ^ T, and 

(iii) ab E T for all a,b GT. 

Proof. (i) Since 1 ■ 1 = 1 = 1 • 1 . IgT. Hence, T ^ 4>. 

(ii) Suppose that 0 € T. Then there exists v G R such that 0r» = 1 = i>0. 
However, Ou = 0 and so 0 = 1, which is a contradiction. Thus, 0 ^ T. 

(iii) Let a, b G T. There exist c,d G R such that ac = 1 = ca and bd = 1 = 
db. Now (ab)(dc) - a(bd)c - ale = ac = 1 and (dc)(ab) — d(ca)b = dlb — db — 

1. Hence, (ab)(dc) = 1 = (dc)(ab). Thus, ab is a unit and so ab G T. ■ 
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Definition 10.1.13 (i) A ring R with 1 is called a division ring (skew- 
field) if every nonzero element of R is a unit. 

(a) A commutative division ring R is called a field. 

Note that a ring R is a division ring (or skew-field) if and only if (R\{0}, •) 
is a group. Therefore, if R is a division ring, then for all a G R, a ^ 0, there 
exists a unique element denoted by a~ l G R such that aa~ 1 = 1 = a _ 1 a. We 
call a -1 the multiplicative inverse of a. Similarly, a ring R is a field if and only 
if (i?\{ 0 }, •) is a commutative group. 

Example 10.1.14 (i) The ring Z of integers is not a field. In Z, the only 
invertible elements are 1 and — 1 . 

(ii) From, Exam,pie 2.1.3, (Q,+,-) is a fi.eld, where + and ■ are the usual 

addition and multiplication, respectively. Q is called the field of rational 

numbers. 

(Hi) From, Example 2.1.3, (R, +, •) is a field, where + and ■ are the usual 
addition and multiplication, respectively. R is called, the field of real num¬ 
bers. 

(iv) From Example 2.1.3, (C, +,•) is a fi,eld, where + and ■ are the usual 

addition and multiplication, respectively. C is called the field of complex 

numbers. 


The following example is due to William Rowan Hamilton. Due to physical 
considerations, Hamilton constructed a consistent algebra in which the com¬ 
mutative law of multiplication fails to hold. At the time, such a construction 
seemed inconceivable. His work and H.G. Grossman’s work on hypercomplex 
number systems began the liberation of algebra. Their work encouraged other 
mathematicians to create algebras, which broke with tradition, e.g., algebras 
in which ab = 0 with a 0 , b 0 and algebras with a n = 0 , where a /0 and 
n is a positive integer. 

Example 10.1.15 Let Qr — {(01,02,03,04) | o* G R, i — 1 , 2 , 3 , 4 }. Define 
+ and ■ on Qr as follows: 


(01,02,03,04) + (61,62, & 3 , bfi ) — (ai 61,02 + ^2103 + 63,04 + 64) 


(op 02, 03,04) ■ (61, bo. bz - bfi ) — (01&1 — 0262 ~ O3&3 — 0464, 0162 + 02^1 
+0364 — 0463, 0163 + 0361 + 0462 — 02^4, O1&4 + 02^3 — 0362 + O461). 


From, the definition of + and *, it follows that + and • are binary opera¬ 
tions on Qr. Now + is associative and commutative since addition is asso¬ 
ciative and commutative in R. We also note that (0, 0, 0, 0) G Qr is the ad¬ 
ditive identity and if (01,02,03,04) G Qr, then (—01,-02,-03,-04) G Qr 

“Pufuc. 



10.1. ELEMENTARY PROPERTIES 


275 


and — (ai, a 2 , 03, 04 ) = (—ai, — a 2 , — <2,3, — 04). Hence, (Qr, +) is a commu¬ 
tative group. Similarly, ■ is associative and (1,0,0,0) 6 Qr is the mul¬ 
tiplicative identity. Let (ai,a 2 , <23, ( 24 ) £ Qr be a. nonzero element. Then 
N = a\ + al + a§ + 04 7 ^ 0 and N £ R. 77ms, (ai/N, -a 2 /iV, —a^/N, —a^/N) 
€ Qr. We as/c i/?,e reader to verify that (a\/N, — a 2 /7V, —a^/N, —a^/N) is the 
multiplicative inverse of (ai,a 2 , <23,<24). Thus, Qr 7 's a division ring ■and, is 
called the ring of real quaternions. However, Qr is not commutative since 
(0,1,0,0)(0,0,1,0) = (0,0,0,1) ^ (0,0,0,-1) - (0,0,1,0)(0,1,0,0). There¬ 
fore, Qr is not a field,. 


A nonzero element a in a ring R is called a zero divisor if there exists b £ R 
such that b 7 ^ 0 and either ab = 0 or ba = 0. We do not call 0 a zero divisor. 
An element cannot be a unit and zero divisor at the same time (Worked-Out 
Exercise 1 , page 279). Thus, a field has no zero divisors. 

Definition 10.1.16 Let R be a commutative ring with 1. Then R is called an 
integral domain if R has no zero divisors. 


The ring of integers Z is an integral domain. The ring M 2 (Z) is not an 


example, 


1 

0 


0 

0 


0 

0 


1 

0 


£ M 2 (Z) and 


Also, M 2 (Z) has 

zero 

divisors 

0 1 


1 0 


' 00 ' 

1 

0 

0 


L 0 0 


0 0 


.We 


also note that every field F is an integral domain since every nonzero element 
of F is a unit. 


Example 10.1.17 Z[\/3] = (a-f-6\/3 | a, b £ Z} is an integral, domain, where 
the operations + and ■ are the usual operations of addition and multiplication. 
0 + Oa/3 is the additive identity of Z[\/3] and 1 + Oy/3 is the multiplicative 
identity of Z[\/3]. Suppose \/3 is a unit in Z[\/3]. Then (\/3) 1 — a + 6v / 3 for 
som,e a,b £ Z. If a = 0, then (\/3 ) _1 = by/ 3 or 1 = 36, which is a. contradiction 
since this equation has no solution in Z. Therefore, a/0 and so 1 = av / 3 + 36 
or V3=^eQ, a contradiction. Hence, y/3 is not a unit, proving that 
Z[\/3] is not afield. 

By arguments similar to the ones used in Example 10.1.17, we can show 
that the following sets are integral domains under the usual addition and mul¬ 
tiplication. 


Ayft = 

{a + by/n | 

a,b £ Zj 

Z [iy/n\ = 

{a + biyfn 

| a, b £ Z} 

Z[i] - 

{a + bi | a, 

6 6 Z} 

Q[yM = 

{a + by/n | 

a ,6 G Q} 

q[i^/n\ = 

{a + biy/n 

| a, 6 £ Q} 

QW = 

<a + A»j 
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where n is a fixed positive integer and i 2 = — 1. In fact, it can be shown that 
Q[v/n], Qand Q[i] are fields. 

Example 10.1.18 The ring of even integers E is a commutative ring, without 
identity, and without zero divisors. 

The ring appearing in the following example is sometimes useful in the 
construction of counterexamples. 

Example 10.1.19 Let (R, +) be a comm.uta.tive group. Define multiplication 
on R by ab = 0 for all a,b € R, where 0 denotes the identity element, of the 
group (i?, +). Then (R, +, •) is a ring called the zero ring. If R contains m,ore 
than one element, then R is a comm.uta.tive ring without 1 and every nonzero 
element of R is a zero divisor. 

The following theorem establishes a relation between zero divisors and the 
cancellation property of a ring. 

Theorem 10.1.20 Let R be a ring. If R has no zero divisors, then the can¬ 
cellation laws hold, i.e., for all a, b, c E R, a 0, ab = ac implies b = c (left 
cancellation law) and ba — ca implies b = c (right cancellation law). If 
either cancellation la,w holds, then R has no zero divisors. 


Proof. Suppose R has no zero divisors. Let a, b, c 6 R be such that 
ab = ac and a / 0 . Then ab — ac — 0 or a(b — c ). — 0 . Since R has no zero 
divisors and a / 0 , we have 6 — c= 0 or 6 = c. Hence, the left cancellation law 
holds. Similarly, the right cancellation law holds. Conversely, suppose one of 
the cancellation laws hold, say, the left, i.e., if a, 6, c G R, a ^ 0, then ab = ac 
implies b = c. Let a be a nonzero element of R and b E R. Suppose ab = 0. 
Then ab — a 0 , from which 6 = 0 by canceling a. Suppose ba = 0 and 6 7^ 0 . 
Then ba = 60 and by canceling b, we obtain a = 0 , a contradiction. Therefore, 
6 = 0 . Hence, R has no zero divisors. Similarly, the right cancellation law 
implies that R has no zero divisors. ■ 


Definition 10.1.21 A ring R is called a finite ring if R has only a finite 
num,ber of elem.en.ts; otherwise R is called an infinite ring. 

The rings Z and M 2 (Z) are infinite. 


Example 10.1.22 Consider Z n together with the binary operations + n and • n 
as defined in Examples 2.1.5 and 2.1.6. By Example 2.1.5, (Z n , + n ) is a com¬ 
mutative group and by Example 2.1.6, • n is associative and commutative, and 
[1] is the multiplicative identity of (Z n , + n , - n ). Now for all [a], [6], [c] E Z n , 
[a] -n ([fr] Tn [c]) = [a] ■ n [6 + c] = [a (6 + c)] = [a 6 + ac] = [ab] + n [ac] = 
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[a] - n [b] +n [a] ■ n [c]. Similarly, ([6] + n [c]) ■„ [a] = [6] - n [a] + n [c] ■„ [a]. Hence, both 
distributive laws hold. Thus, ( Z n , + n , - n ) is a, commutative ring with 1 , called 
the ring of integers mod n. From, Example 2 . 1 . 6 , not every nonzero element 
of Z n has an inverse. For example, suppose n is not prim,e, say, n = 6 . Then 
[4] has no multiplicative inverse in Z§. Also, Zq has zero divisors. We have 
[ 3 ] fiz [ 0 ] ^ [ 2 ]. Since [3] -6 [2] = [6] = [0], it follows that [3] and [2] are zero 
divisors. Thus, Zq is not an integral dom,ain and thus not a field. We ca,n also 
conclude that [ 2 ] and [3] do not have multiplicative inverses since they are zero 
divisors. 

The above example shows that for every positive integer n, there exists a 
commutative ring R with 1 such that the number of elements in R is n. 

In the following result, we assume that the ring R is commutative. This 
assumption can be removed and the conclusion that R is a field remains valid. 
However, we have not developed the appropriate results to remove this as¬ 
sumption. We will prove the theorem in its most general form in Chapter 
24. 

Theorem 10.1.23 A finite com,mutafive ring R with m,ore than one elem,ent 
and without zero divisors is a field. 


Proof. We must show that R has an identity and that every nonzero element 
of R is a unit. Let ai,a 2 ,...,a n be the distinct elements of R. Let a £ R, 
a / 0. Now aai £ R for all i and so {aai, aa 2 ,..., aa n } C R. If aa t = aa 3 , 
then by Theorem 10.1.20, a* — a 3 . Therefore, the elements aai, &a 2 >-- ■ > aa n 
must be distinct and so R — {aai, • • • ,aa n }. This implies that one of the 
products must be equal to a, say, aai = a. Since R is commutative, we also 
have a*a = aa i — a. Let b be any element of R. Then there exists aj £ R such 
that b = aaj. Thus, 

bai = afi (since R is commutative) 

= afiaaj) (substituting for 6 ) 

— (a^a)aj 
—- aaj 

= b. 

This implies that a^ is the identity of R. We denote the identity of R by 1. Now 
1 £ R — {aai,aa 2 ,... ,aa n ] and so one of the products, say, aaj , must equal 
1. By commutativity, a^a = aaj = 1 . Hence, every nonzero element is a unit. 
Consequently, R is a field. ■ 

The following corollary is immediate from above theorem. 

Corollary 10.1.24 Every finite integral dom,ain is a field. ■ 
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In Example 2.1.6, we showed that a nonzero element [a] of Z n has an inverse 
if and only if gcd(a,n) = 1. Thus, the following corollary is an immediate 
consequence of this fact. We leave the details as an exercise. 

Corollary 10.1.25 Let n be a positive integer. Then Z n is a field if and only 
if n is prime. ■ 

Let R be a ring and a G R. Then for any integer n, define no as follows: 

Oa = 0 

no = a + (n — l)a if n > 0 

na = (—n)(—a) if n < 0. 

We emphasize that no is not a multiplication of elements of R since R may 
not contain Z. We have the following properties holding for any a, b E R and 
any m, n £ Z : 

(m + n)a = ma + no, 
m(o 4-6) — ma 4- mb, 

( mn)a = m(na ), 
m(o6) = (mo)6 = o(m6), 

( ma)(nb ) = mn(ab). 

The proofs of the above properties can be obtained by induction and the 
defining conditions of a ring. 

Definition 10.1.26 If there exists a. positive integer n such that for all a £ R, 
na — 0, then the smallest such positive integer is called the characteristic of R. 
If no such positive integer exists, then R is said to be of characteristic zero. 

Example 10.1.27 The rings Z, Q, R, C ha,ve characteristic 0. The ring Z n 
(n = 1,2,3,...) has characteristic n. Note that in Zq, 3[2] = [6] = [0] and 
2 [3] — [6] = [0]. However, 6 is the smallest positive integer such that 6 [a] = [0] 
for all [a] € Zq. In particular, [1] has additive order 6. 

Example 10.1.28 Let X be a nonempty set and V(X) the power set of X. 
Then (fP(X), A, D) is a comm,utative ring with 1, where A is the opera,tion 
“symmetric difference." In this exam,pie, A acts as + and fl acts as ■. Now for 
all A 6 V{X ), 2 A = ANA — (A\yl) U (7l\yl) = <f>. Thus, V(X) has character¬ 
istic 2. 


Theorem 10.1.29 Let R be a ring with 1. Then R has characteristic n > 0 if 
and only if n is the least positive integer such that nl = 0. 
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Proof. Suppose R has characteristic n > 0. Then na — 0 for all a G R 
and so, in particular, nl = 0. If ml = 0 for 0 < m < n, then ma = m(la) = 
(ml)a = 0a = 0 for all a <E R. However, this contradicts the minimality of 
n. Hence, n is the smallest positive integer such that nl = 0. Conversely, 
suppose n is the smallest positive integer such that nl = 0. Then for all a € R, 
na = n(la) — (nl)a = 0a = 0. By the minimality of n for 1, n must be the 
characteristic of R. ■ 

Theorem 10.1.30 The characteristic of an integral dom,ain R is either zero 
or a prime. 

Proof. If there does not exist a positive integer n such that na — 0 for 
all a G R, then R is of characteristic zero. Suppose there exists a positive 
integer n such that na = 0 for all a G R. Let m be the smallest positive integer 
such that ma = 0 for all a £ R. Then ml = 0. If m is not prime, then there 
exist integers mi, m 2 such that 0 < mi,m 2 < m and m = mim. 2 . Hence, 
0 = (mim 2 )l = (mil)(m 2 l). Since R has no zero divisors, either mil = 0 or 
m 2 l = 0. This contradicts the minimality of m. Thus, m is a prime. ■ 

10.1.1 Worked-Out Exercises 

0 Exercise 1 Let R be a ring. An element a 6 R is called idempotent if 
a? = a and nilpotent if a n = 0 for some positive integer n. 

(i) Let a £ R be a nonzero idempotent. Show that a is not nilpotent. 

(ii) Let R be with 1. Let a G R be such that a has an inverse. Show that 
a cannot be a zero divisor. 

(iii) Let R be with 1 and suppose R has no zero divisors. Show that the 
only idempotents in R are 0 and 1. 

Solution: (i) From the hypothesis, a 2 = a. By induction, a n — a for 
all positive integers n. Suppose a is nilpotent. Then a m = 0 for some positive 
integer m and so a = a m = 0. which is a contradiction and so a is not nilpotent. 

(ii) There exists 6 £ R such that ab — 1 = 6a. Suppose that a is a zero 
divisor. Then there exists c £ R. c ^ 0, such that ac = 0. Thus, 0 = 60 — 
6(ac) = (6a)c = c, which is a contradiction. Hence, a is not a zero divisor. 

(iii) Clearly 0 and 1 are idempotent elements. Let e € R be an idempotent. 
Then e 2 = e and so e(e — 1) = 0. Since R has no zero divisors, either e = 0 or 
e — 1 = 0, i.e., either e = 0 or e = 1. Therefore, the only idempotents of R are 
0 and 1. 

0 Exercise 2 Determine positive integers n such that Z n has no nonzero 
nilpotent elements. 





10.1. ELEMENTARY PROPERTIES 


280 


Solution: We claim that n is a square free integer, i.e., n — P 1 P 2 • * • Pk , 
where the pi s are distinct primes. 

Suppose that n = PiP2‘‘-Pk , Pi’s are distinct primes. Let [a] E Z n be 
nilpotent. Then [a] m = [0] for some integer m. Hence, n divides a m and so 
P\P2 ■ • 'Pk divides a m . Thenp*|a m for alH = 1, 2,..., k. Since thep z ’s are prime, 
Pi | a for alii = 1,2,..., k. Since pi,P2, • ■ • ,Pk are distinct primes, we must have 
P 1 P 2 • • 'Pk\ a i be., n\a and so [< 2 ] = [0]. This implies that Z n has no nonzero 
nilpotent elements. Conversely, suppose that Z n has no nonzero nilpotent 
elements. Let n = p^p ™ 2 - • -p™ fc , where the pi s are distinct primes and 
mi > 1. Let m = max{mi,m 2 ,... ,m k }. Now \p\p 2 • ■ ■Pk} m = [pi m p ™ * ■ -pj 1 ] = 
[0] since nKp^p™ ■ • -p™). Also, since Z n has no nonzero nilpotent elements, 
[P 1 P 2 ■ • ‘Pk] = [0]. Hence, n\(pi ■•■p k ) and so (p^p ™ 2 • • -p™ fc )l(Pi '"Pk)- Thus, 
mi <1 for alH = 1, 2 ,..., k. Hence, = 1 for alH = 1 , 2,..., k and so n is a 
square free integer. 

0 Exercise 3 Show that the number of idempotent elements in Z mn , where 
m > 1, n > 1, and m and n are relatively prime, is at least 4. 

Solution: Clearly, [0] and [ 1 ] are idempotent elements. Since m and n 
are relatively prime, there exist integers a and b such that am + bn = 1. We 
now show that n does not divide a and m does not divide b. Suppose that n\a. 
Then a = nr for some integer r. Thus, n(rm + b) = nrm + nb — am + nb = 1. 
This implies that n = 1, which is a contradiction. Therefore, n does not divide 
a and similarly m does not divide b. Now m?a — m( 1 — nb). This implies 
that [m 2 a] = [m]. Hence, [ma ] 2 = [ma\. If [ma\ = [0], then mn\m,a and so 
n|a, which is a contradiction. Consequently, [ma] 7 ^ [0]. If [ma] = [ 1 ], then 
mn\(ma — 1 ). Hence, ma + m,nt = 1 for some integer t. Thus, m{a + nt) = 1. 
This implies m = 1, which is a contradiction. Hence, [ma] 7 ^ [1]. Thus, [m,a] 
is an idempotent such that [ma] 7 ^ [0] and [ma] 7 ^ [ 1 ]. Similarly, [nb] is an 
idempotent such that [nb] 7 ^ [0] and [nb] 7 ^ [ 1 ]. Clearly [ma] 7 ^ [nb]. Thus, we 
find that [0], [1], [ma], and [nb] are idempotent elements of Z mn . 

0 Exercise 4 Determine the positive integers n such that Z n has no idempo¬ 
tent elements other than [0] and [1]. 


Solution: We show that n = p r for some prime p and some integer r > 0. 

First assume that n = p r for some prime p and some positive integer r and 
[x] € Z n be an idempotent. Then [x] 2 = [x]. Thus, p r \(x 2 — x) or p T \x(x — 1). 
Since x and x — 1 are relatively prime, p r \x or p r \{x — 1 ). If p r \x, then [x] = [0] 
and if p r |(a: — 1), then [x] = [1]. Thus, [0] and [1] are the only two idempotent 
elements. Conversely, suppose that [0] and [ 1 ] are the only two idempotent 
elements. Let n = p^p™ 2 ■ • -p™ fc , where the p ^s are distinct primes, mi > 1, 


and k > 1. Let t = p and s — p■■ -p™ k - Then t and s are relatively 
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prime and n = ts. By Worked-Out Exercise 3 , Z n = Z ts must have at least 
four idempotents, which is a contradiction. Therefore, k — 1. Thus, n = p T for 
some prime p and some positive integer r. 

Exercise 5 Let R be a ring. Show that the following conditions are equivalent. 

(i) R has no nonzero nilpotent elements. 

(ii) For all a E R, if a 2 = 0, then a — 0. 

Solution: (i)=>(ii) Let a E R and a 2 = 0. If a / 0, then a is a nonzero 
nilpotent element of R, a contradiction. Thus, a = 0. 

(ii)=>(i) Let a E R be such that a n = 0 for some positive integer n. Suppose 
a 7 ^ 0. Let n be the smallest positive integer such that a n = 0. Suppose n is 
even, say, n = 2m for some positive integer m. Then (a m ) 2 = a 2m = 0 and 
so a m = 0, contradicting the minimality of n. Suppose n is odd. If n = 1, 
then a = 0, a contradiction. Therefore, n > 1. Suppose n = 2 m + 1. Then 
m + 1 < n. Thus, a 2rn+2 = a 2rn+1 a = a n a — 0. This implies that a m+1 = 0, 
which is a contradiction of the minimality of n. Hence, R has no nonzero 
nilpotent elements. 

0 Exercise 6 An element e of a ring R is called a left (right) identity, if 
ea = a (ae = o) for all a E R. Show that if a ring R has a unique left 
identity e, then e is also the right identity of R and hence the identity of 
R. 


Solution: Let e be the unique left identity of R. Then ex = x for all x E R. 
Let x E R. Now (xe — x + e)x — xex — xx + ex = xx — xx + x = x. This implies 
that xe — x + e is a left identity. Since e is the unique left identity, xe — x + e = e 
and so xe = x. Thus, e is a right identity. 

Exercise 7 Let R be a commutative ring with 1 and a, 6 6 R. Suppose that a 
is invertible and b is nilpotent. Show that a + b is invertible. Also, show 
that if R is not commutative, then the result may not be true. 


Solution: There exists c E R such that ac = 1 = ca and there exists a 

positive integer n such that b n = 0. Let d = c — c 2 6 +c 3 6 2 H-h(—1 ) n+ 1 c n 6 TJ ' -1 . 

Now (a+b)d = ac—ac 2 b+ ac 3 b 2j r ■ ■ • + ( —l) n+ 1 ac n 6 n- 1 + 6 c — 6 c 2 6 + 6 c 3 6 2 + - • ■ + 
(—l) n+ 1 6 c n 6 n-1 = 1 — cb+c 2 b 2j r- • ■ + ( — \) n+l c n ~ l 6 n-1 + bc—c 2 b 2j t- c 3 6 3 + - —f 
(—l) n+ 1 c n 6 n = 1. Similarly, d(a T b) — 1. Hence, a + b is invertible. 


Consider the ring M 2 (Z). Let a 


0 -1 

-1 0 


and b = 


0 1 
0 0 


. Then 


a is invertible and b is nilpotent. Now a + b = 


0 0 

-1 0 


Clearly a + b is a 


nonzero nilpotent element. Hence, a + b is not invertible. 
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10.1.2 Exercises 

1 . In the rings Zg and Zq, find the following elements: 

(i) the units, (ii) the nilpotent elements, and (iii) the zero divisors. 

2. Let R be the set of all 2 x 2 matrices over the field of complex numbers 


of the form 


, where 2 : denotes the complex conjugate of the 


21 22 

~Z2 ^1 

complex number 2 . Show that ( R , +, •) is a division ring, where + and • 
are the usual matrix addition and matrix multiplication, respectively. Is 
R a field? 

3. Let R be a ring with 1. Prove that 

(i) (— l)a = —a = a(~ 1 ) and (— 1 )(— 1 ) = 1 , 

(ii) if a is a unit in R, then —a is a unit in R and (—a ) -1 = —(a -1 ). 

4. Prove that a ring R is commutative if and only if (a 4- b ) 2 = a? + 2 ab 4 - b 2 
for all a,b £ R. 

5. Prove that a ring R is commutative if and only if a 2 — b 2 = (a + b) (a — b ) 
for all a,b £ R. 

6 . Let R be a ring. If a 3 = a for all a £ R, prove that R is commutative. 

7. Let R be a commutative ring and a,b £ R. Prove that for all n £ N, 


n 


(a 4- b) n — a n + [ ^ ) a n + ■ • • + l la 


n 


n 


n ~ r b r + •••+( \ab n ~ 1 +b n . 

r 1 \ n — 1 


8 . If a and b are elements of a ring and m and n are integers, prove that 

(i) ( na){mb ) = ( nm)(ab ), 

(ii) n(ab ) = ( na)b = a(nb ), 

(iii) n(— a) = (—n)a. 

9. If R is an integral domain of prime characteristic p , prove that (a + b) p = 
a p 4 - bP for all a, b £ R. 

10. Let R be a ring with 1 and without zero divisors. Prove that for all 
a, b £ R, ab = 1 implies ba = 1. 

11. Let R be a ring with 1. If a is a nilpotent element of i?, prove that 1 — a 
and 1 + a are units. 

12. Let R be a division ring and a,b £ R. Show that if ab = 0, then either 
a — 0 or b = 0. 
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13. Let a G R be an idempotent element. Show that (1 — a)ba is nilpotent 
for all b G R. 

14. Find all idempotent elements of the ring M 2 (R). 

15. Let R be a ring with 1. Let 0 ^ a G R. If there exist two distinct elements 
b and c in R such that ab — ac = 1, show that there are infinitely many 
elements x in R such that ax = 1. ( American Mathematical Monthly 
70(1961) 315). 

16. Let R be an integral domain and a, 5 G R. Let m,n 6 Z be such that m 
and n are relatively prime. Prove that a m — b m and a n = b n imply that 
a — b. 

17. Let R and R! be rings. Define + and • on R x R' by for all (a, 6), (c, d ) G 
RxR' 

(a, 6) + (c, d) — (a + c, b + d) and (a, b) ■ ( c , d) = (a • c,b - d). 


(i) Prove that (ft x ft', +, •) is a ring. This ring is called the direct sum 
of R and R' and is denoted by R 0 R'. 

(ii) If R and R' are commutative with identity, prove that R © R' is 
commutative with identity. 

18. Extend the notion of direct sum in Exercise 17 to any finite number of 
rings. 

19. Prove that the characteristic of a finite ring R divides \R\ . 

20. Let R be a ring with 1. Prove that the characteristic of the matrix ring 
M 2 (-ft) is the same as that of ft. 

21. If p is a prime integer, prove that (p — 1)! —1. 


22. In the following exercises, write the proof if the statement is true; other¬ 
wise, give a counterexample. 

(i) In a ring ft, if a and b are idempotent elements, then a + 6 is an 
idempotent element. 

(ii) In a ring ft, if a and b are nilpotent elements, then <2 + 6 is a nilpotent 
element. 

(iii) Every finite ring with 1 is an integral domain. 

(iv) There exists a field with seven elements. 

(v) The characteristic of an infinite ring is always 0. 
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(vi) An element of a ring R which is idempotent, but not a zero divisor, 
is the identity element of R. 

(vii) If a and b are two zero divisors, then a + b is also a zero divisor in 
a ring R. 

(viii) In a finite field F, a 2 + b 2 = 0 implies a — 0 and b = 0 for all 
a, b £ F. 

(ix) In a field F, (a + 6) _1 = a~ 1 + 6 _1 for all nonzero elements a , b such 
that a + b ^ 0 . 

(x) There exists a field with six elements. 


10.2 Some Important Rings 

In this section, we introduce two important rings and study some of their basic 
properties. 


10.2.1 Boolean Rings 


We recall that in Worked-Out Exercise 1 (page 279), an element x of a ring 
R is called an idempotent element if x 2 = x. The zero element and identity 
element of a ring are idempotent elements. In the ring Z, the only idempotent 
elements are 0 and 1. There exist rings, which contain idempotent elements 


different from 0 and 1. For example, in M 2 (Z), 


1 0 
2 0 


is an idempotent 


element. 


Definition 10.2.1 A ring R with 1 is called a, Boolean ring if every element 
of R is an idempotent. 

Example 10.2.2 (i) Z 2 is a. Boolean ring. 

(ii) The ring V(X) of Example 10.1.28 is a Boolean, ring since for all A £ 
V{X),AGA = A. 

Theorem 10.2.3 Let R be a Boolean ring. Then the characteristic of R is 2 
and R is commutative. 


Proof. First we show that R is of characteristic 2. Let x £ R. Now x + x = 
(.x + x ) 2 = (x + x)( 2 : + a;) = x{x + x) -\-x(x + x) = x 2 -\-x 2 -\-x 2 -\-x 2 = x + x + x + x. 
This implies that 2x — 4r and so 0 = 2x. Hence, 2-1 = 0 since x was arbitrary. 
It follows that the characteristic of R is 2 by Theorem 10.1.29. To show R 
is commutative, let x,y £ R. Then x + y = (x + y) 2 = {x + y)(x + y) = 
x 2 + xy + yx + y 2 = x + xy + yx + y. This implies that 0 = xy + yx. Hence, 
xy — xy + 0 = xy + xy + yx or xy = 2xy + yx = yx since 2 xy = 0. Thus, R is 
commutative. ■ 
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10.2.2 Regular Rings 

An element z of a ring R is called a regular element if there exists y E R 
such that x — xyx. 

Definition 10.2.4 A ring R is called a regular ring if every element of R is 
regular. 

In the ring Z, the only regular elements are 0,1, and — 1 . Thus, Z is not a 
regular ring. 

Example 10.2.5 Let R be a division ring and x E R. If x = 0, then x — xxx. 
Suppose i/O. Then zz "* 1 = 1 and so x = xx~ l x. Thus, R is a regular ring. 

From the definition of a Boolean ring, it follows that every Boolean ring is 
a regular ring. The field R is a regular ring, but not a Boolean ring. 

Example 10.2.6 Consider R, the fi.eld of real numbers a.nd. 

R x R ={(x,y) | x,y E R}. 

Define + and • on R x R by 

(x, y) + (z, w) = ( x + z,y + w ) 

{x,y)-(z,w) = ( xz,yw) 

for all x,y,z,w E R. Then R x R is a, commutative ring with identity. Now 
(1,0), (0,1) € RxR and (1,0) (0,1) = (0,0). This shows that RxR con¬ 
tains zero divisors and so R x R is not a field. We claim, that RxR is 
regular. Let (x,y) E R x R. If x - 0 = y, then (x,y)(x,y)(x,y) = (x,y). 
If x 0 and y ^ 0, then (z, y)(z _1 , y~ l ){x, y) = (x,y). If x - 0, but y 
0, then (x,y)(x,y~ l )(x,y) = (x.y). Similarly, if x ^ 0 and y = 0, then 
(z, y){x~ l , y)(x, y) = (z,y). Thus, in any case, (x,y) is a regular element. 
Hence, RxR is a regular ring. 


Example 10.2.7 Let M 2 (R) be the set of all 2 x 2 m, a trices over R. Now 
M 2 (R) is a noncom,mutative ring with 1, where + and, ■ are the usual matrix 
addition and multiplication, respectively. We show that M 2 (R) is a, regular 

ring. Let A— ^ E M 2 (R). 


ABA. 

Case 2: xw — zy = 0. 


Case 1: xw - zy 0. Then B = xw T z zy xw fi zy E M 2 (R) and A = 


xw—zy xw—zy 


“PilKc. 7feaiAe»tatLca/ 



10.2. SOME IMPORTANT RINGS 


286 


Subcase 2a: x,y,z,w are all zero. In this case, A = 


0 0 
0 0 


and so for 


any B £ M 2 (R), ABA = A. 

Subcase 2b: x,y,z,w are not all zero. Suppose x ^ 0 and let B = 

i o 


x 

0 0 


Then 


ABA = 


x 

z 

1 

Z_ 

X 


y 

w 

0 

0 



" £ 0 ’ 


* 


0 0 


z w 


H 



2: W 


J n 

r 

“1 


x y 


X 

y 

z ^ 

X 


z 

w 


since xw — zy = 0 and, i / 0 implies w = ^. If y ^ 0, then let B = ^ 

. v 

Then 


ABA = 

r 

X 

Z 

6 «£ 

_ 1 


' 0 

1 

y 

1- 

O O 


X 

z 

y 

w 


1 

0 ' 


X 

y 





w 

y 

0 


z 

w 





X 

y 



X 

y 



wx 

. v 

w 



z 

w 



Similarly, if z 7 ^ 0 or w 7 ^ 0 , then we can find, B such that ABA = A. 
M 2 (R) is a regular ring. 


0 

0 


Thus, 


Since M 2 (R) is not a division ring, it follows that a regular ring need not be 
a division ring. However, a division ring is a regular ring as shown in Example 
10.2.5. In the next theorem, we show that a regular ring under a suitable 
condition becomes a division ring. 


Theorem 10.2.8 Let R be a regular ring with more than one element. Suppose 
for all x £ R, there exists a unique y £ R such that x = xyx. Then 

(i) R has no zero divisors, 

(ii) if x 7 ^ 0 and x — xyx , then y = yxy for all x,y £ R, 

(Hi) R has an identity, 

(iv) R is a, division ring. 


Proof. (i) Let x be a nonzero element of R and xz = 0 for some z £ R. 
Now by hypothesis, there exists a unique y £ R such that xyx = x. Thus, 
x(y — z)x = xyx — xzx ~ xyx. Hence, by the uniqueness of y, y — z = y and so 
z — 0. This proves that R has no zero divisors. 
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(ii) Let x 7 ^ 0 and xyx = x. Then x(y — yxy) = xy — xyxy — xy — xy = 0. 
Since R has no zero divisors and x ^ 0, y — yxy = 0 and so yxy = y. 

(iii) Let 0 7 ^ x £ R. Then there exists a unique y £ R such that xyx = x. 
Let e = yx. If e = 0, then x — xyx = 0, which is a contradiction. Therefore, 
e / 0 . Also, e 2 = yxyx — y(xyx) = yx = e. Let z € R. Then (ze - z)e = 
ze 2 — ze = ze — ze = 0. Thus, by (i), either ze — z ~ 0 or ze — z. Similarly, 
e(ez — z) — 0 implies that ez = z. Hence, e is the identity of R. 

(iv) By (iii), R contains an identity element e. To show R is a division ring, 
it remains to be shown that every nonzero element of R has an inverse in R. 
Let i be a nonzero element in R. Then there exists a unique y £ R such that 
xyx = x. Thus, xyx = xe, i.e., x(yx — e) = 0. Since R has no zero divisors and 
x =4 0, yx — e — 0 and so yx = e. Similarly, xyx — ex implies xy — e. Therefore, 
xy — e — yx. Hence, R is a division ring. 


10.2.3 Exercises 

1. Prove that a Boolean ring R is a field if and only if R contains only 0 
and 1 . 

2. Prove that a ring R with 1 is a Boolean ring if and only if for all a, b £ R, 
(<a + b)ab — 0 . 

3. Let R be a Boolean ring with more than two elements. Find all zero 
divisors of R. 

4. Let T — {/ | / : R —»■ Z 2 }. Define + and • on T by for all /,g £ T, 
(/ + g)( x ) = f( x )+ g( x ) and (fg)( x ) = f( x )g( x ) for all X £ R. Show 
that (T, + , •) is a Boolean ring. 

5. Prove that a nonzero element of a regular ring with 1 is either a unit or 
a zero divisor. 

6 . Prove that the center of a regular ring is regular. 

7. Let R be a ring in which each element is idempotent. Let R = R x Z 2 . 
Define + and ■ on R by (a, [n]) + (b, [to]) = (a + b, [n + to]) and (a, [n]) - 
(b, [to]) = ( na + mb + ab, [nm]) for all (a, [n]), (6 , [to]) £ R. Show that + 
and • are well defined on R and R is a Boolean ring. 

8 . Let R be a regular ring with 1. 

(i) Prove that for any a £ R, there exists an idempotent e £ R such that 
Ra — Re. 

(ii) Prove that for any two idempotents e, / £ i?, there exists an idem- 
potent g £ R such that Re + Rf = Rg. 
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William Rowan Hamilton (1805- 
1865) was born on August 4, 1805, in Dublin, 
Ireland. He was the fourth of nine children. 
His early education from the age of three 
was provided by his uncle. By the age of 
five, he was proficient in Latin, Greek, and 
Hebrew. 

Hamilton started reading Newton’s Prin- 
cipia when he was about 15 and became in¬ 
terested in astronomy. In 1822, he discov¬ 
ered an error in Laplace’s Mecanique celeste , 
which was conveyed to John Brinkley thro¬ 
ugh a friend. Brinkley later helped Hamil¬ 
ton in getting appointed as his successor at Dunsink Observatory. 

On April 23, 1827, while still an undergraduate at Trinity College, Hamilton pre¬ 
sented his paper, “Theory of Systems of Rays,” to the Royal Irish Academy. This paper 
is responsible for creating the field of mathematical optics. Hamilton introduced the 
characteristic function, his first discovery. On June 10, 1827, he was appointed as¬ 
tronomer royal at Dunsink Observatory and professor of astronomy at Trinity College, 
even though he did not have a degree. 

Hamilton’s major contributions were in the algebra of quaternions, optics, and 
dynamics. He gave few examples to illustrate his concepts and so his papers were hard 
to read. He spent most of his life on the study of quaternions. 

Hamilton was interested in three-dimensional complex numbers, which he called 
“triplets.” He had little success in this area, as he was able to add, but could not find 
a suitable multiplication rule. He then considered the so-called quaternions. While he 
was walking along the Royal Canal on October 16, 1843, the discovery of the quater¬ 
nions flashed in his mind. He immediately scratched the multiplication formula for the 
quaternions on the stone of a bridge over the canal. Hamilton discovered that he could 
give up the commutative law of multiplication and still have a meaningful algebraic 
system. The geometric significance of the quaternions was realized when Hamilton and 
Cayley independently showed that the quaternion operators rotated vectors about a 
given axis. In 1837, Hamilton corrected Abel’s proof of the impossibility of solving the 
general quintic equations. 

Hamilton’s name is associated with concepts such as Hamiltonian functions, Hamil- 
tonian-Jacobi differential equations, Hamiltonian path in graph theory, and the Cayley- 
Hamilton theorem in linear algebra. He coined the terms “vector,” “scalar,” and 
“tensor.” Hamilton died on September 2, 1865. 
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Chapter 11 

Subrings, Ideals, and 
Homomorphisms 


The most important substructure of a ring is a particular subset called an 
“ideal.” The term ideal was coined by Dedekind in honor of Rummer’s work 
on ideal numbers. This notion of Rummer and Dedekind was used to ob¬ 
tain unique factorization properties. Rummer introduced the idea of an ideal 
number in his work on Fermat’s last theorem. Noether followed with some 
important results on the theory of ideals. Some of her ideas were inspired by 
the work not only of Dedekind, but also of Kronecker and Lasker. 

11.1 Subrings and Subfields 

In this section, we introduce the idea of a subring of a ring. This concept is 
analogous to the concept of a subgroup of a group. 

Definition 11.1.1 Let (R, +, •) be a ring. Let R' be a subset of R. Then (R', 
+, •) is called a subring of (R, +, •) if ( R\ +) is a subgroup of (R, +) and 
for all x,y 6 Rx ■ y £ R'. 

Let (R' , +, •) be a subring of the ring (R, +, •). Since R! C R and since the 
associativity for - and the distributive laws are inherited, ( R y , +, -) is itself a 
ring. We will usually suppress the operations + and ■ and call R! a subring of 
R. When R! and R are fields, R' is called a subfield of R. 

The following theorem gives a necessary and sufficient condition for a subset 
to be a subring. With these conditions it is easy to verify whether a nonempty 
subset of a ring is a subring or not. 

Theorem 11.1.2 Let R be a ring. A nonempty subset R! of R is a subring of 
R if and only if x — y £ R' and xy £ R' for all x,y £ R!. 
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Proof. First suppose that R! is a subring of R. Then R r is a ring and so for 
all x, y G R, x — y, xy G R'. Conversely, suppose x — y G R' and xy G R' for 
all x,y G Since x — y G R ; for all x,y £ R!, (R !, +) is a subgroup of (R, +) 
by Theorem 4.1.3. By the hypothesis, xy G R! for all x,y G R! . Hence, R! is a 
subring of R. ■ 

Example 11.1.3 (i) The ring E of even integers is a subring of Z. E is without 

1 . 

(ii) Consider the subset Es = {[0], [2], [4], [ 6 ]} of Z%. Then Es is a subring 
of Zg. Hence , Es is commutative. However, Es has no identity and Eg does 
have zero divisors, nam.ely, [2], [4], and. [ 6 ]. 

Example 11.1.4 Let Qz = {( 01 , 02 , 03 , 04 ) | a, G Z, i = 1,2,3,4}. Define + 
and - on Qz as in Exam,pie 10.1.15. Since the difference a,nd product, of integers 
is an integer, we h,a,ve 

( 01 , 02 , 03 , 04 ) — ( 61 , 62 , 63 , 64 ) G Qz 


and 

(01,02,03,04) • (61,62,63,54) G Qz 

for all (ai, 02,03,04), (61,62, 63, 64) G Qz- Hence, Qz is a subring of Qr. We 
note that Qz is noncomm.uta.tive, has an identity, and is without zero divisors. 
Now (0,2,0, 0) G Qz and (0,2,0, 0) _1 — (0, — ^,0,0) ^ Qz- Thus, Q z is not a 
division ring. 

Example 11.1.5 Set Qe = {(ai, 02,03, 04) | a* G E, i = 1,2,3,4}. Defi,ne T 
and ■ on Qe as in Example 10.1.15. Since the difference and product, of even 
integers is an even integer, we fi,nd that Qe is a subring of Qz- In fa.ct, Qe is 
a noncom,mutative ring without identity and, without zero divisors. 

Example 11.1.6 Consider the ring M2(Z) of Example 10.1.4. Let M2(E) de¬ 
note the set of all 2x2 matrices with entries from E. Since the sum, difference, 
and, product of even integers is an even integer, it follows that M 2 (E) is a sub¬ 
ring of M 2 (Z). Also, M 2 (E) is a noncom,mutalive ring without identity and 
with zero divisors. 


Following along the lines of Theorem 11.1.2, we can prove the next theorem. 
We leave its proof as an exercise. 

Theorem 11.1.7 Let F be a field. A nonempty subset S of F is a, subfi.eld, of 
F if and only if 

(i) S contains mure than one elem.ent, 

(ii) x — y,xy G S for all x,y G S', and 
(Hi) x~ 1 G S for all x € S, x 0. ■ 
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Example 11.1.8 Q and Q[\/2] = {a + by/2 \ a, b E Q} are subfields of Li (see 
Worked-Out Exercise 4 below). 

Theorem 11.1.9 Let R be a, ring (fi,eld) and, {Ri | i E A} be a nonempty 
family of subrings (subfields) of R. Then C\ l( z/ v R l is a subring (subfield) of R. 

Proof. Since 0 E for all i E A, 0 E D^^Ri and so Cl^Ri 4>- Let 
x,y E r\i € \Ri. Then x,y E Ri for all i £ A. Since each R z is a subring, 
x — y, xy E Ri for all i G A. Hence, x — y,xy E PI^a Ri- Thus, Pi t€ A Ri is a 
subring of R. 

Similarly, if each Ri is a subfield of the field R , then D^A-Ri is a subfield of 

R.m 

It is interesting to note that the intersection of all subfields of R is Q. 

11.1.1 Worked-Out Exercises 

0 Exercise 1 Let X be an infinite set. Then (P(X), A, Pi) is a ring with 1 . 
Let 

R = {A E V(X) | A is finite}. 

Prove the following assertions. 

(i) R is a subring of V(X). 

(ii) R is without identity. 

(iii) For all A E R, A f>, A is a zero divisor in R. 

(iv) For all A E P(X), A -=fi X, A <f>, A is a zero divisor in V(X). 

Solution: (i) Since 6 is finite, <f> E R and so R is nonempty. Let A,B E R. 
Then A and B are finite and so AnB is finite. Now AAB = (ylUH)\(^4nH) 
and so AAB is finite. Therefore, AAB, A Pi B E R. Thus, R is closed under 
the operations A and fl. Now it is easy to verify that ( R , A, Pi) is a subring. 

(ii) Suppose R has an identity, say, E. Then E is finite. Since X is infinite, 
there exists a E X such that a E. Now {a} E R. Thus, {a} = E Pi {a} = <f, 
which is a contradiction. Hence. R has no identity. 

(iii) Let A E R and Afio. Since A is finite and X is infinite, there exists 
iGl such that x A. Now {2:} E R. Since A PI {r} = fi, A is a zero divisor. 

(iv) Let A E V(X) be such that A X and A <f>. Then there exists 
x E X such that x £ A. Hence, A Pi {x} = <f and so A is a zero divisor. 

Exercise 2 Let R be a ring such that a 2 + a is in the center of R for all a E R. 
Show that R is commutative. 
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Solution: Let x,y E R. Then (x + y ) 2 + x + y E C(-R), i.e., x 2 + xy + yx+ 
y 2 + x+ y E C(i?). Since z : 2 + x, y 2 +y E C(.R) and C(i?) is a subring (Exercise 
14, page 294), xy + yx £ C(R). Therefore, x(xy + yx) — (xy + yx)x and 
so x 2 y + xyx = xyx + yx 2 . Thus, x 2 y — yx 2 . Now x 2 + x £ C(R ) and so 
y(x 2 + x) = (x 2 + x)y. Hence, yx 2 + yx — x 2 y + xy and so xy = yx, proving 
that R is commutative. 


0 Exercise 3 Find all subrings of the ring Z of integers. Find those subrings 
which do not contain the identity element. 

Solution: Let n be a nonnegative integer and T n — nZ = {nt \ t £ Z}. 
Since 0 £ T n , T n ^ 0. Let a — nt, b = ns be two elements in T n . Then 
a — b = nt — ns = n(t — s) £ T n and ab — ( nt){ns ) = n(t(ns )) £ T n . Hence, T n 
is a subring of Z. We now show that if A is any subring of Z, then A — T n for 
some nonnegative integer n. 

Let A be a subring of Z. If A — {0}, then A = 0 Z. Suppose A ^ {0}. 
Then there exists m £ A such that m ^ 0. Now —m £ A and so A contains a 
positive integer. By the well-ordering principle, A contains a smallest positive 
integer. Let n be the smallest positive integer in A. Then nZ CA. Let m £ A. 
By the division algorithm, there exist integers q and r such that m = nq + r, 
0 < r < n. Since n £ A, nq £ A. Hence, r — m — nq £ A. The minimality of 
n implies that r = 0 and so m = nq £ nZ. Thus, A = nZ. If n ^ 1, then nZ 
does not contain identity. 

0 Exercise 4 Show that Q[V2] = {a + by/2 £ R | a, b £ Q} is a subfield of 
the field R. 


Solution: Since 0 = 0 + 0\/2 £ Q[\/ 2 ], Q[\/2] / 4 >. Let a + by/ 2 , c + dy/2 
£ Q[\/2]. Then 

(a + by/ 2) — (c + dy/ 2) = (a — c) + (b — d)y/2 £ Q[\/ 2 ] 


and 

(a + by/ 2) (c + dy/ 2) = (ac + 26d) + (ad + 6c)\/2 £ Q[\/2]. 

Now 0 + Oy/2 and 1 + Oy/2 are distinct elements of Q[\/2]- Therefore, Q(\/2] 
contains more than one element. Let a + by/2 be a nonzero element of Q[\/ 2 ]. 
Then a and b cannot both be zero simultaneously. We now show that a — by/2 ^ 
0. Suppose a — by/ 2 = 0. Then a = by/2. If b = 0, then a = 0. Therefore, both 
a and b are zero, a contradiction. If 6 / 0, then y/2 = f £ Q, a contradiction. 
Hence, a — by/2 / 0. Similarly, a + by/2 ^ 0. Thus, a 2 — 2 b 2 = (a + by/2) (a — 
by/ 2) / 0. Now 


1 


(a — by/2) 


a 


a + by/2 cl 2 — 2 b 2 


a 


2 _ 


■y/2 £ Q[y/2\. 


■J2b 2 a2-2&2 v “^ LV 7 

ritte mo^Lc#fta£uia/ r4y.Ai.cJ. 
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Since (a + = 1 > (a + by/2) 1 exists in Q[\/2]. Thus, we find that 

Q[\/ 2 ] is a subfield of R by Theorem 11.1.7. 


11.1.2 Exercises 

1. Prove the following the statements. 

| a, b, c G zl is a subring of M 2 (Z). 


(i) Ti = 

(ii) T 2 = 

(iii) T 3 = 

(iv) T 4 = 


a b 
0 c 


a b 
—b a 

a 0 
0 a 

a b ] 
0 a 


| a, b € Z j is a subring of M 2 (Z). 
a e z| is a subring of M 2 (Z). 
a, b € zl is a subring of T\. 


2. In the ring Z of integers, find which of the following subsets of Z are 
subrings. 

(i) The set of integers of the form 4k + 2 , k e Z. 

(ii) The set of integers of the form 4k 4 - 1 , k e Z. 

(iii) The set of integers of the form 4k , k E Z. 

3. Show that T = {[0], [5]} is a subring of the ring Z 10 . 

4. Let 1? be a ring with 1. Show that the subset T = {nl | n (E Z} is a 
subring of R. 


5. Let R be a ring and n be a positive integer. Show that the subset T = 
{a £ R | na = 0} is a subring of R. 


6 . Show that T — 


a by /3 
—by/ 3 a 


a, b € R 


is a subring of M 2 (R). 


7. 

8 . 

9. 


Show that Q[\/3] and Q[\/5] are subfields of the field R, but Z[y/2] = 
{a + by/ 2 | a, b e Z} is not a subfield of R. 

Show that Q(i) = {a -f bi \ a. b G Q} is a subfield of C, where i 2 = — 1 . 


Show that F = 
field? 


a —b 
b a 


a. 


b£ Z F 


is a subring of M 2 ( Z 5 ). Is F a 


10. Let w be a root of x 2 + x + 1 = 0. Prove that T = {a + bu> j a, b e Q} is 
a subfield of the field of complex numbers. 

“Pi“PlLy-S/LcS- 
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11. Let F be a field of characteristic p > 0. Show that T = {a E F \ a? — a} 
is a subfield of F. 

is a subring of M 2 (Z). Also, 
show that every nonzero element of T is a unit in Af 2 (R). 

13. Let R be a commutative ring. Show that the set 

T = {r E R | r n = 0 for some integer n} 
is a subring of R. 

14. Prove that C(jR) is a subring of R and that C(R) is commutative. 

15. Let e be an idempotent of a ring R. Prove that the set 

eRe = {ere \ r E R} 

is a subring of R with e as the identity element. 

16. Find the center of the ring M 2 (R). 

IT. Prove that the characteristic of a subfield is the same as the characteristic 
of the field. 

18. Find all subrings with identity of the ring Z\q. 

19. Find all subfields of the field Z p , p a prime integer. 

20. Let R be a ring without any nonzero nilpotent elements. Show that 
( ara — ra) 2 = 0 for all r E R and for all idempotent elements a E R. 
Hence, show that C(R) contains all idempotent elements. 

21. Let C=:{/:R—►R|/is continuous on R}. Define + and • on C by 

(f + g)(x) = f(x)+g(x), 

(f-g){x) = f{x)g(x) 

for all f,g £ C and for all x 6 R. 

(i) Show that C is a ring. 

(ii) Let D = {f E C \ f is differentiable on R}. Show that D is a subring 
of C. 

22. Let R be a ring and / : R —» [0,1] be such that 

f(a~b ) > min{/(a),/(&)}, 

f(ab) > min{/(a), f(b)} 

for all a,b E R. Prove that for all t E 2 '(f), Rt = {x E R \ f(x) > i} is a 
subring of R. 

~Pu*uc- 


12. Prove that T = 


x + y 

-y 


y 

X 


x,y EZ 
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23. In the following exercises, write the proof if the statement is true; other¬ 
wise, give a counterexample. 

(i) The union of two subrings of a ring is a subring. 

(ii) The identity element of a subring is always the identity element of 
the ring. 

(iii) Q is the only subfield of the field R. 

(iv ) Q[\/3] = { a + 6\/3 | a, b 6 Q} is the intersection of all subfields of R 
which contain y/3. 

(v) The set Z of integers is a subring of the field of real numbers. 

(vi) Every additive subgroup of Z is a subring of Z. 


11.2 Ideals and Quotient Rings 

In this section, we introduce the notions of ideals and quotient rings. These 
concepts are analogous to normal subgroups and quotient groups. 

The very famous problem called “Fermat’s last theorem” led to the inven¬ 
tion of ideals. Fermat (1601-1665) jotted many of his results in the margin 
of Diophantus ’ Arithmetica. For this particular “theorem,” Fermat wrote that 
he discovered a remarkable theorem whose proof was too long to put in the 
margin. The theorem is stated as follows: If n is an integer greater than 2, 
then there exist no positive integers x, y, z such that x n + y n = z n . However, no 
one was able to prove this result until recently; in 1994, Andrew Wiles found 
a proof after many years of work. 

In 1843, Kummer (1810-1893) thought that he had found a proof of Fer¬ 
mat’s last theorem. However, Kummer had incorrectly assumed uniqueness of 
the factorization of complex numbers of the form x -I- A y, where A p = 1 for p 
an odd prime. Dirichlet (1805-1859) had made an incorrect assumption about 
factorization of numbers. Kummer continued his efforts to solve Fermat’s last 
theorem. He was partially successful by introducing the concept of “ideal num¬ 
ber.” Dedekind (1831-1916) used Kummer’s ideas to invent the notion of an 
ideal. Kronecker (1823-1891) also played an important part in the development 
of ring theory. 


Definition 11.2.1 Let R be a ring. A nonempty subset I of R is called a left 
(right) ideal of R if for all a. b € I and for all r € R, a — b G I , ra £ I 
(a — b £ I, ar £ /). 


A nonempty subset I of a ring R is called a (two-sided) ideal of R if I is 
both a left and a right ideal of R. 

From the definition of a left (right) ideal, it follows that if I is a left (right) 
ideal of f?, then I is a subring of R. Also, if R is a commutative ring, then 





11.2. IDEALS AND QUOTIENT RINGS 


296 


every left ideal is also a right ideal and every right ideal is a left ideal. Thus, 
for commutative rings every left or right ideal is an ideal. 

By Theorem 11.1.2, it is clear that a nonempty subset I of a ring R is an 
ideal if and only if (/, +) is a subgroup of (R, +) and for all a £ I and for all 
r £ R, ar and ra £ I. 

Example 11.2.2 Let R be a ring. The subsets {0} and R of R are (left, right) 
ideals. These ideals are called trivial ideals. All other (left, right) ideals are 
called nontrivial. 

An ideal / of a ring R is called a proper ideal if / 7 ^ R. 

Example 11.2.3 Letn £ Z and I = {nk \ k € Z}. As in Worked-Out Exercise 
3 (page 292 ), I is a subring. Also, for all r £ Z, ( nk)r — n(kr) £ I a,nd r(nk ) — 
n{rk ) £ I. Hence, I is an ideal of Z. 

Next, we give an example of a ring in which there exists a left ideal which 
is not a right ideal, a right ideal which is not a left ideal, and a subring which 
is not a left (right) ideal. 

Example 11.2.4 Consider the ring M 2 (Z). Let 

/, = {[; °] | <*,(> 6 z} , 
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proving that Ii is a left ideal of M 2 CZ 1 ). Now 
M 2 (Z), but 


'10' 


1 

0 

1 ’ 


01" 

1 0 


0 

_1 

0 


0 1 


0 

0 


e h 




and, 


0 1 
0 0 


€ 


Hence, I\ is not a right ideal of M 2 (Z). Similarly, I 2 is a right ideal of M 2 CZ 1 ), 
but not a left ideal, I 3 is an ideal of M 2 (Z), and I 4 is a subring, but not an 
ideal of M 2 CZI). 


We remind the reader to notice the similarity of the next few results with 
corresponding results in linear algebra and group theory. 

Theorem 11.2.5 Let R be a, ring and {I a | a G A} be a nonem.pty collection 
of left (right) ideals of R. Then Pi a pA^a ? ' 5 a left (eight) ideal of R. 


Proof. Suppose [I a | a € A} is nonempty a collection of left ideals of R. 
Since 0 £ / Q for all a, 0 £ O a I a and so fl a I a ^ (j). Let a, & £ n Q / Q . Then 
a, b E I a for all a. Since each I a is a left ideal, a — b € I a for all a. Hence, 
a — b E n a I a . Let r £ Since each I a is a left ideal of R , ra E I a for all a 
and so ra E D a I a . Thus, fl a / a is a left ideal of R. Similarly, if {I a \ a E A} is 
a nonempty collection of right ideals of R , then fl a I a is a right ideal of R. ■ 

Let ai, <22,..., E R. Then by the notation Yn=i a i, we mean the sum 
o-i T a 2 T • ■ • + a n . 


Definition 11.2.6 Let S be a nonempty subset of a ring R. Define {S) l to be 
the intersection of all left ideals of R which contain S. Then the left ideal (S) l 
is called the left ideal generated by S. Similarly, we can define (S) r , the 
right ideal generated by S, and (S) , the ideal generated by S. 

Note that (S) l is the smallest left ideal of R which contains S. 

Theorem 11.2.7 Let R be a ring and S be a nonempty subset of R. Then 

fi) 

(S)i = {£?=1 nsi + J2 l j=i n j s j | Vi e R, n 3 G Z, s i} Sj G 5, 

1 < i < k,l < j < l,k,l G N}. 


{ S) r = {ELi s i r i + Ej =1 n j s j \ n e R, n 3 e Z, si, sj e S, 
1 < i < fc, 1 < j < l, k, l £ N}. 
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Proof. (i) Let 

A = {Eti r iSi + Ej=i n j s j | r % E R,rij E Z, s*, € 5, 

1 < i < fc, 1 < j < /, /c, / E N}. 

Since (R); is the intersection of all left ideals of R which contain S', we have 
(S'); D S. Also, since (S); is closed under addition and closed from the left 
under multiplication by elements of R , we have A C ( S) l . We now show that 
A is a left ideal of R such that ADS. Then A D (S) l since (S) L is the 
smallest left ideal of R containing S. Let s E S. Then s — 0 • s + Is E A and 
so S C A. Let Yli=i r iSi + Ej=i n j s 'j an< ^ ELi ^ s 7 + e A. Then 

= (EjLi^+E<=i(-^) d )+ 
(Ej=i n j s j + EJEi(—rij)s') E A. Let r E R. Then r(E)f =1 W + Ej-=i n j s j) = 
Ef=i( r 7 ’z) 5 i + Ej=i( n ; r ) s j ^ A. Hence, A is a left ideal of R. 

(ii) The proof is similar to (i). ■ 


Corollary 11 . 2.8 Let R be a ring and S be a nonempty subset of R. If R is 
with 1 , then 

fi) 

k 

(S), = {E r * Si | ri E R, Si E S, 1 < i < k, n E N}. 

i= l 

(ii) 

k 

<S>, = { 2 > ri | Ti E R, Si E S, 1 < i < A;, n E N}. 

i=l 


Proof. (i) Clearly (S); D {E?=i DSi | E -R, E S}. Let Ei=i W + 
Ej=i n j s j- £ (S); . Since R has an identity 1, njSj = (njl)s' and rijl E R. Thus, 

ELi usi + Ei=i n j s 'j = Eki ns* + Ej=iK-i)«j e {Eti I n e R, Si e S, 

1 < i < A:, n E N}. Hence, {S) l C {E;=i ^s* | 
n E N}. 

(ii) The proof is similar to (i). ■ 


E R, 5^ E S, 1 < i < fc. 


If S = {ai,a 25 • • •, a n }, then the left ideal (S) l generated by S is denoted 
by (ai, 02 ,..., a n ) l . In this case, we call (S) l a finitely generated left ideal. 

Similar terminology is used for (S) r and (S). If S — {a}, then ( a) l is called the 
principal left ideal generated by a, ( a) r is called the principal right ideal 
generated by a, and (a) is called the principal ideal generated by a. 

Corollary 11.2.9 Let R be a ring and a E R. 

(i) Then 

( a) i = {ra ~\-na \ r £ R,n £ Z >- _ 

Pitte 7feafAe»tatL<ra/ P4y.i.ZcJ. 
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(ii) If R is with 1, then 


(a) i = {ra\r e R}. 

Proof. (i) This assertion follows from the equality 

km k m 

Da + n o a = r *) a + (S n 3 ) a - 

i= 1 j =1 i— 1 j —1 

(ii) This follows from (i) and Corollary 11.2.8. ■ 

Similarly, we can prove that (a) r — {ar 4- na \ r £ R, n £ Z} and (a) = 
{■ra 4 - as + na + Ya =i r t as i I r ? s > £ R, n £ Z,1 < i < k, k £ N}. 

Consider the subsets Ra = {ra \ r £ R} and aR = {ar j r £ i?} of R. If R 
is without identity, then Ra {aR) is still a left (right) ideal of R (Exercise 4, 
page 306). It is not necessarily the case that a £ Ra {a £ aR) as illustrated by 
the next example. 

Example 11.2.10 Consider the ring E of even integers. E does not have an 
identity. (2) = {r2 + n2 | r £ E, n £ Z} = {0, ±2, ±4,...} and 2 £ (2) . 
However, {r2 | r £ E} = {0, ±4, ± 8 ,.. which does not contain 2. 

In the next theorem, we obtain a necessary and sufficient condition for a 
ring with 1 to be a division ring. 

Theorem 11.2.11 Let R be a ring with 1. Then R is a division ring if and 
only if R has no nontrivial left ideals. 


Proof. Suppose R is a division ring. Let I be a left ideal of R such that 
I D {0}. Then there exists a £ I such that a ^ 0 and since I is a left ideal, 
1 = a~ l a £ /. Hence, for all r £ R, r = rl £ 7, whence R = I. 

Conversely, suppose R has no nontrivial left ideals. Let a £ R and a ^ 0. 
Then (a) l — R and so 1 £ (a) ( . Now (a) l = {ra \ r £ i?}, whence there ex¬ 
ists r £ R such that 1 = ra. This implies that r ^ 0. Proceeding as in the 
case of the nonzero element a, we find that tr = 1 for some t £ R. Therefore, 
t = tl = t(ra) = ( tr)a = 1 a = a. Thus, ra = 1 = ar and so a is a unit. 
Consequently, every nonzero element of R is a unit. Hence, R is a division 
ring. ■ 


Following along the lines of the above theorem, we can prove that a ring R 
with 1 is a division ring if and only if R has no nontrivial right ideals. 

The following corollary is immediate from the above theorem. 

“Pi 
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Corollary 11.2.12 Let R be a commutative ring with 1. Then R is a field if 
and only if R has no nontrivial ideals. H 

Definition 11.2.13 A ring R is called a simple ring if R 2 ^ { 0 } and { 0 } 
and, R are the only ideals of R. 

Example 11.2.14 Every division ring is a simple ring. 

Example 11.2.15 In this example, we show that M 2 (R) is a simple ring. 
Let A be a nonzero ideal of M 2 (R). Then there exists a nonzero el.em.ent 

£ A. Now at least one of a , b, c, d is nonzero. Since A is an ideal 

, o o i r o 1 1 

and 0 ’ 0 0 ^ -^(R), we have 



' a b 1 [ 0 0 

c d 10 


b 0 
d 0 


£ A, 




11.2. IDEALS AND QUOTIENT RINGS 


301 


The above example shows that there are simple rings, which are not division 
rings. 

For aGii, aRa denotes the set { ara \ r £ R}. 

We now consider the sum and product of left (right) ideals. 

Let A and B be two nonempty subsets of a ring R. Define the sum and 
product of A and B as follows: 

A T B ~ {u T b | a £ A, b £ B j- 

AB = {a\bi + 0 . 2^2 + • ■ • T cinbn | a i £ A, bi £ B , i = 1, 2,..., n, n £ N}. 

Thus, AB denotes the set of all finite sums of the form Y! a ibi, £ A, 
bi £ B. 

Let n £ N. Inductively, we define 

A 1 = A, 

A n = AA n ~ l if n > 1. 

We now list some interesting properties of these two operations. 

Theorem 11.2.16 Let A, B , and C be left (right) ideals of a ring R. Then the 
following assertions hold. 

(i) A + B — B + A is a left (right) ideal of R. 

(ii) A + A = A. 

(in) {A 4- B) + C = A + (B + C). 

(iv) AB is a. left (right) ideal of R. 

(v) (AB)C = A(BC). ' 

(vi) If A,B and C are ideals, then A(B + C) = AB + AC, (B + C)A = 
BA + CA. 

(vii) If A is a right ideal and B is a left ideal, then AB C A fl B. 

(viii) R is a regular ring if and only if for any right ideal A and for any 
left, ideal B , AB = AoB. 

(ix) The set I(R) of all ideals of R form,s a m,od,ular lattice with respect to 
set inclusion as a partial ordering. 

Proof. We only prove (viii) and (ix) and leave the other properties as 
exercises. 

(viii) Suppose R is a regular ring. Let a £ A D B. There exists 6 E R such 
that a = aba. Since B is a left ideal and a £ B,ba £ B. Thus, a = a(ba) £ AB, 
whence A (1 B C AB. By (vii), AB C A n B. Consequently, AB = A fl B. 
Conversely, assume that AB = AD B for any right ideal A and left ideal B of 
R. Let a £ R and consider (a) r , the right ideal generated by a. Since (a) r is a 
right ideal, (a) r R C (a) r . Also, by our assumption (a) r D R = (a) r R. Hence, 

a £ (a) r n R- (a) r R. 
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Therefore, a = Ya =i a A f° r some a* G (a) r , 6* 6 B, i — 1, 2,..., n. From the 
statements following Corollary 11.2.9, a t = ati + n^a for some t{ £ R, rii € Z, 
z - 1, 2,... ,n. Thus, a = £)?=i a *6* = = a (E?=i(£A+ n ^)) £ 

aB. This implies that (a) r = aB. Since aB C (a) r , (a) r = aft!. Similarly, 
{a)^ — Ra. It now follows that a G aR fl Ra — ( aR)(Ra ) C aRa. Hence, there 
exists b G R such that a = aba , i.e., a is regular. Consequently, R is regular. 

(ix) By using arguments similar to the proof of Theorem 4.1.16, we can show 
that (/(B), C) is a poset. To show (/(B), C) is a lattice, let A, B G I(R). Now 
AflB, A + B G /(B). Also, A, B C A + B. Let C G I(R) he such that A, B C C. 
Since C is an ideal, A + B C C. Hence, A + B = A V B, the lub of {A, B}. 
Similarly, A fl B = A A B, the gib of {A, B}. Thus, /(B) is a lattice. To show 
(/(B), C) is a modular lattice, let A, B, C be three elements in /(B) such that 
ACC. Note that A V (B AC) = A + (B n C) and (A V B) A C = (A + B) D C. 
Now A + (B n C) C (A + B) fl C and so A V (B A C) C (A V B) A C. Let 
x G (A + B) fl C. Then x G C and x G A + B. Thus, x = a + b for some 
a € A C. C and b G B. This implies that 6 = x — a G C and so b G B fl C, 
which shows that x G A + (B fl C). Hence, (A + B) n C C A + (B fl C), i.e., 
(AvB)ACC Av(BAC). Thus, A V (B A C) = (A V B) A C. Consequently, 
/(B) is a modular lattice. ■ 

We now give the analogue of quotient groups for rings. Let B be a ring 
and / an ideal of B. Then (/, +) is a normal subgroup of (B, +) since the 
latter group is commutative. Hence, if Rjl denotes the set of all cosets r + I — 
{r + a | a G /} for all r G B, then (B//, +) is a commutative group, where 

(r + I) + (/ + I) ~ ir + r') +1 

for all r+I, r' + I G R/I. Now define multiplication on R/I by (r+I)- ( r'+I ) = 
rr' + I for all r + /, r' + / G B//. Then (R/I, +, •) forms a ring. We leave the 
details as an exercise. 

Definition 11.2.17 If R is a ring and I is an ideal of B, then the ring 
(B//, +, ■) is called the quotient ring of R by I. 

Theorem 11.2.18 Let n G Z be a fixed positive integer. Then the following 
conditions are equivalent. 

(i) n is prime. 

(ii) Z/ (n) is an integral domain. 

(Hi) Z / (n) is afield. 

Proof. (i) =Kii): Let a + (n ), b + (n) G Z/ ( n) . Suppose 

(a + (n))(b + (n)) = 0 + (n). 
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Then ab + (n) = 0 -fi (n) and so ab £ (n ). Thus, there exists r £ Z such that 
ab — rn. This implies that n\ab. Since n is prime, either n\a or n\b, i.e., either 
a £ (n) or b £ (n) and hence either a + (n) = 0 + (n) or b + (n) — 0 + (n) . 
This implies that Z/ (n) has no zero divisors, proving that Z/ (n) is an integral 
domain. 

(ii) =^(iii): Since Z/ { n) is a finite integral domain, the result follows from 
Theorem 10.1.23. 

(iii) =>(i): Suppose n is not prime. Then n = n,\n 2 for some 1 < n\ < n and 
1 < rj ,2 < n. Now Tii + (n) and 77,2 + {n) are nonzero elements of Z/ (n) and 

(ni + (n))(n 2 + (n)) = 77,1772 + { n) = n 4- {n) = 0 + { n) . 

Since Z/ in) is a field, Z/ (n) has no zero divisors. Thus, either ni + (n) = 
0 + (n) or 772 + i n ) = 0 + ( 77 ) , a contradiction. Therefore, n is prime. ■ 

Definition 11.2.19 Let I be an ideal of a ring R. 

(i) I is called a nil ideal if each element of I is a nilpotent element. 

(ii) I is called a nilpotent ideal if I n — {0} for som,e positive integer n. 

Example 11.2.20 In the ring Zs, the ideal I = {[0], [4]} is a nil ideal and 
also a nilpotent ideal. 1 2 = {Si=i[ a i][^] I £ I•, k £ N} = {0} since 

16| aib{. 

From the definition, it follows that every nilpotent ideal is a nil ideal. 
The following example shows that the converse is not true. In this exam¬ 
ple, we construct a ring R from the rings Z p n, n — 1,2,..., i.e., from the rings 
Z p , Z p 2 , Zp3,..., where p is a fixed prime. 

Example 11.2.21 Let p be a fixed prim.e. Let R be the collection of all se¬ 
quences {a n } such that a n £ Z p n (77 > 1) a.nd. there exists a, positive integer 
777 (dependent on {a n }) such that a n = [0] for all n > m. Define addition and 
multiplication on R by 

{a n } + {6 n } = {a n + 6 n }, 

for all {a„},{6 n } £ R. We ask the reader to verify that R is a commutative 
ring under these two operations, where the zero element is the sequence {a n } 
such that a n = [0] for all n and the additive inverse of the sequence {a n } is the 
sequence {— a n }. Now in Z p n, [p] is a nilpotent element since (p} n — \p n ] = [0]. 
Thus, for any [r] £ Z p n, [p][r] = (pr\ is a nilpotent element. Therefore, we find 
that each element of [p]Z p n is a nilpotent el.em.ent. 

Let 


I = {{[p]ai, [p]o 2 ,..., (p\a n , [0], [0],.. •] e R | 77 £ N, £ Z i,i = 1,... ,n}. 
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Then I is an ideal of R. Also, every el.em.ent of I is nilpotent. Let us now show 
that I is not nilpotent. Suppose I is nilpotent. Then there exists a positive 
integer m such that I m = {0}. Now the sequence {a n } such that a n = \p] for 
n — 1,2,..., m+1 and a n = 0 for all n > m + 2 is an el.em.ent of I. Then 
{a n } m = {[0], [0], ..., [0], [p m ], [0], [0 ],...}, where the (m + T)th term, of this 
sequence is [p m ] and all other terms are 0. Since [p m ] is not zero in Z p m+i, we 
find that {a n } m 0 and {a n } m G 7 m = {0}, a contradiction. This implies that 
I is not nilpotent. 

Theorem 11.2.22 Let R be a commutative ring with 1 a,nd. I denote the set 
of all nilpotent elements of R. Then 

(i) I is a nil ideal of R, 

(ii) the quotient ring Rfl has no nonzero nilpotent elements. 

Proof. (i) Since 0 G 7, I 7 ^ 4>. Let a, b G I. There exist positive integers m 
and n such that a n = 0 and b m — 0. Since R is commutative, we can write 

(a - b) n+m = a n+m + • • • 4- (-l) r ^ + m ^ja n+Tn - r b r + ■ • - + (-1 ) n +™b n+Tn . 

The general term of the above expression is (—l) r ( n ^ m )o 7l+m_7 '6 r , where 0 < 
r < m, + n. If r < m, then n + m — r > n and hence a n+Tn ~ r - a n a Tn ~ T — 0. 
Again, if r > m, then b T - 5 Tn +( 7 ' _m ) — ym^T-m _ q Therefore, we find that 
(_l)^n+m) a n+m-r 6 r = q, r = o, 1,2,... ,n + m. This implies that ( a-b) n+Tn - 
0, i.e., a —bis nilpotent and so a — b G /. Let r £ R. Then (ra) n = r n a n = r n 0 — 
0. Since R is commutative, (ar) n = ( ra) n = 0. Thus, ar , ra G /. Consequently, 
/ is an ideal of R. Since every element of I is nilpotent, I is nil. 

(ii) Let a + I be a nilpotent element of R/I. Then (a + I) n — I for some 
positive integer n. But a n +1 = (a + I) n . Thus, a n +1 = /, which implies that 
a n G I. Since every element of / is nilpotent, there exists a positive integer 
m such that (a n ) m = 0, i.e., a nTn = 0, which shows that a is nilpotent and so 
a £ I. This implies a + I = I. Hence, R/I has no nonzero nilpotent elements. ■ 

Theorem 11.2.23 Let A a.nd B be two nil ideals of a commutative ring R 
with 1. Then A + B is a nil ideal. 

Proof. By Theorem 11.2.16, we know that A + B is an ideal of R. Let I 
be the set of all nilpotent elements of R. Then A C I, B C I and by Theorem 
11.2.22, I is an ideal. Hence, A + B C I. Since 7 is nil, A + B is nil. ■ 


11.2.1 Worked-Out Exercises 
<0 Exercise 1 Find all ideals of Z. 
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Solution: From Worked-Out Exercise 3 (page 292), we know that the 
subrings of Z are the subsets nZ, n — 0,1, 2,.... Let us now show that these 
subrings are precisely the ideals of Z. If / is an ideal of Z, then / is a subring 
of Z and so / — nZ for some nonnegative integer n. Now, let I — nZ (n is a 
nonnegative integer). Then / is a subring. If r <E Z, then rl — r(nZ) =n(rZ) C 
nZ = I. Similarly, Ir C /. Hence, / is an ideal of Z. 

Exercise 2 Let R be a ring such that R has no zero divisors. Show that if 
every subring of R is an ideal of R , then R is commutative. 

Solution: Let 0 ^ a € R. Then C(a) — {x € R \ xa = ax} is a subring 
of R and hence an ideal of R. Thus, ra 6 C(a ) for all r 6 R. Let r € R. Now 
ara — ra 2 implies that (ar — ra)a = 0. Since R has no zero divisors and a^O, 
ar — ra = 0 and so ar = ra. Hence, a is in the center of R. Since a is arbitrary, 
R is commutative. 


0 Exercise 3 Give an example of a ring R and ideals A*, i € I, such that 
A t D Aj = {0} if i / j, but A t fi (Ej^ 2 Aj) ^ {0}. 

Solution: Let R = {0 ,a,b,cj. Define + and • on R by 

2a — 2b = 2c — 0, xy = 0, for all x,y 6 R and 
a-\-b=b + a — c, a-f-c = c + a = 6, and b + c — c + b — a. 

Then (ii,+,•) is a ring. Let A\ = {0, a}, A 2 = {0,6}, and A 3 = {0, c}. Then 
A\ + A 2 — Ai + ^.3 = A 2 + A 3 = R and A\ n A 2 — A\ fi A3 = A 2 n A3 = {0}. 

0 Exercise 4 Give an example of a ring R and ideals A and B such that 
AB C Afl B. 


Solution: Let R be the ring of Worked-Out Exercise 3. Let A = B = {0, a}. 
Then AB = {0} C {0, a} — A Pi B. 

0 Exercise 5 Characterize all commutative rings R such that R has only two 
ideals R and {0}. 


Solution: Let R be a commutative ring such- that the only ideals of R are 
R and {0}. Now R 2 is an ideal of R. Thus, R? = {0} or R? ~ R. 

Case 1. R 2 = r {0}. Then <26 = 0 for all a, 6 € R. In this case, every subgroup 
of (i?, +) is an ideal. Hence, ( R , +) has no proper subgroups and so ( R , +) is 
a cyclic group of prime order by Exercise 21 (page 138). 

Case 2. R 2 — R. Let 0 ^ a 6 R. Then aR is an ideal of R. Hence, either 
aR — {0} or aR = R. Suppose aR = {0}. Let T = (a). Then T is an ideal 
of R and a E T. Thus, T = R. Now aR = {0} implies that TR — {0} and 
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hence R 2 = {0}, which is a contradiction. Therefore, aR = R. Thus, for all 
0 / a 6 aR = R. We now show that R has no zero divisors. Let a, b be 
two nonzero elements of R such that ab = 0. Let T = {c £ R \ ac = 0}. 
It is easy to see that T is a nonzero ideal of R. Hence, by the hypothesis, 
T — R. This implies that R — aR — aT = {0}, a contradiction to the fact that 
R = R 2 ^ {0}. Consequently, R has no zero divisors. Next, for 0 / a G 
aR = R and so we find that ae — a for some e £ R. Since a / 0, we must have 
e 7^ 0. Also, since R has no zero divisors, a(e 2 — e) = 0 implies that e 2 — e. Now 
for any b £ R, eb = e 2 b implies that e(b — eb) = 0 and hence b — eb — be. This 
shows that e is the identity element of R. Also, aR = R implies that e = ab for 
some b £ R. Hence, a -1 exists in R. Consequently, R is a field. 

So from the above two cases we conclude that either R is the zero ring with 
a prime number of elements or R is a field. 


a b 
0 c 


11.2.2 Exercises 

1. Let T 2 (Z) = 
matrices over Z. 

(i) Prove that I = 
quotient ring T 2 (Z)/7. 

(ii) Prove that I = 
quotient ring T 2 (Z)/7. 


a, b, c £ Z > be the ring of all upper triangular 


0 b 
0 c 

0 a 

0 0 


6, c E Z > is an ideal of T 2 (Z). Find the 


a £ Z > is an ideal of T 2 (Z). Find the 


2. In the ring Z 2 4 , show that I = {[0], [8], [16]} is an ideal. Find all elements 
of the quotient ring Z 24 //. 

3. Show that the set / = {a + biy/5 [ a, b £ Z and a — b is even} is an ideal 
of the ring Z[iy/b\. 

4. Let R be a ring and a £ R. Show that aR is a right ideal of R and Ra is 
a left ideal of R. 


5. Let R be a ring. Let A be a left ideal of R and B be a right ideal of R. 

Show that AB is an ideal of R and BA C A fl B. 

6 . Let R be a ring such that R? 7 ^ {0}. Prove that R is a division ring if 
and only if R has no nontrivial left ideals. 

7. Let R be a ring with 1. Prove that R has no nontrivial left ideals if and 
only if R has no nontrivial right ideals. 

“Pi l*ic- “PlLySAjcS. 
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8 . 

9. 

10 . 

11 . 


12 . 

13. 


14. 

15. 


16. 


17. 


18. 

19. 


Let / 1 , I 2 be ideals of a ring R. Prove that I\ U I 2 is an ideal of R if and 
only if either I\ C I 2 or I 2 Q I\. 

Let / and J be ideals of a ring R. Prove that I + J is an ideal of R and 
that I + J = (I U J) , the ideal of R generated by I U J. 

Let I be an ideal of a commutative ring R and a € R. Prove that 
(I U {a}) — {i + ra + na \ i € /, r G R, n E Z}. 

Let m, and n be positive integers in Z. Prove that 

(i) (m, n) = (m) + (n) = (d) , where d is the greatest common divisor of 
m and n; 

(ii) (m) Pi (n) — (q) , where q is the least common multiple of ra and n. 
Find all ideals of the Cartesian product F\ x F 2 of two fields F\ and F 2 . 

Consider the Cartesian product ring R\ x R 2 of the rings R\ and R 2 . 

(i) If Ii is an ideal of R\ and I 2 is an ideal of R 2 , prove that I\ x I 2 is an 
ideal of R\ x R 2 . 

(ii) Suppose Ri and R 2 are with 1 and I is an ideal of R\ x R 2 . Does 
there exist ideals I\ of R\ and I 2 of R 2 such that I = Ij x RI 

Let R be an ideal of a ring R. Prove that the quotient ring R/I is a 
commutative ring if and only if ab — ba G I for all a, 6 € R. 

Let T = {§ | f € Q, a and b are relatively prime and 5 does not divide 6}. 
Show that T is a ring under the usual addition and multiplication. Also, 
prove that I = {| G T \ 5 divides a} is an ideal of T and the quotient 
ring T/I is a field. 

Let I be an ideal of a ring R. Prove that if R is a commutative ring with 
identity, then R/I is a commutative ring with identity. If R has no zero 
divisors, is the same necessarily true for R/I? 

Let I be an ideal of a commutative ring R. Define the annihilator of I 
to be the set 


ann I = {r € R \ ra = 0 for all a 6 I}. 

Prove that ann I is an ideal of R. 

In the ring Z 20 , prove that / = {[n] | n is even} is an ideal. Find ann I. 

In the ring Z[z], show that I = {a + bi \ a, b E Z and a, b are even} is an 
ideal. Find ann I. 

~Pu*uc- 
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20. In a commutative regular ring R with 1, prove that every principal ideal 
I is generated by an idempotent and for every principal ideal /, there 
exists a principal ideal J such that R = I + J and I n J = {0}. 

21. Prove that every ideal of a regular ring is regular. 

22. Prove that a ring R is regular if and only if every principal left ideal of 
R is generated by an idempotent. 

23. Prove that in a commutative regular ring with 1 every finitely generated 
ideal is a principal ideal. 

24. In a ring i?, prove that {0} is the only nilpotent ideal if and only if for 
all ideals A and B of R ) AB = {0} implies An B — {0}. 

25. Let R be a ring and / : R —► [0,1] be such that 

f{d-b) > min {/(a),/(&)}, 
f(rb) > f(b) 

for all a, b, r E R. Prove the following: 

(i) /(0) > /(a) for all a E R; 

(ii) /(a) = /(—a) for all a E R; 

(iii) for all t E T(/), Rt = {x E R \ f(x ) > f} is a left ideal of R ; 

(iv) Rq = {a E R | /(a) = /(0)} is a left ideal of R. 

26. Let R be a ring. A relation p on R is called a congruence relation on 
the ring R if p is an equivalence relation on R and for all o, b, c E R , apb 
implies that acp&c, capcb, and (a + c)p(6 + c). Let I be an ideal of R and 
p be the relation on R defined by apb if and only if a — b E /. Show that 
p is a congruence relation on R. 

27. In each of the following exercises, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) If {Ii | i E N} is a collection of ideals of R , then U^n-^ is an ideal of 
R. 

(ii) Z is a subring of R, but not an ideal of R. 

(iii) If I is a nontrivial ideal of an integral domain R, then the quotient 
ring R/I is an integral domain. 



11.3. HOMOMORPHISMS AND ISOMORPHISMS 


309 


11.3 Homomorphisms and Isomorphisms 

In this section, we introduce the ideas of homomorphisms and isomorphisms 
of rings. These concepts are the analogs of homomorphisms and isomorphisms 
for groups. 

Definition 11.3.1 Let ( R , +, •) and (R ', +', •') be rings and f a function from. 
R into R'. Then f is called a homomorphism of R into R' if 

f(a + b) — f(a) +' f(b), 

f(a ■ b) = f{a) ■' f(b) 

for all a,b 6 R. 

A homomorphism / of a ring R into a ring R' is called 

(i) a monomorphism if / is one-one, 

(ii) an epimorphism if / is onto R', and 

(iii) an isomorphism if / is one-one and maps R onto R!. 

If / is an isomorphism of a ring R onto a ring R! , then / _1 is an isomorphism 
of R f onto R. 

An isomorphism of a ring R Onto R is called an automorphism. 

Definition 11.3.2 Two rings R and R! are said, to be isomorphic if there 
exists an isomorphism, of R onto R!. 

We write R ~ R! when R and R! are isomorphic. 

When speaking of two rings R and R', from now on we usually use the 
operations + and ■ for both rings. Let f : R —* R' be a homomorphism of 
rings. Since / preserves +, / is a also a homomorphism of the groups (R, +) 
and (R' , +). Hence, we can immediately apply Theorem 5.1.2 to conclude that 
/ maps 0 to O', i.e.. /(0) = O', and for all a £ i?, —/(a) = /(—a). We list 
some properties of homomorphisms in the following theorem. The proofs are 
similar to the proof of Theorem 5.1.2 and so we leave them as an exercise for 
the reader. 


Theorem 11.3.3 Let f be a homomorphism, of a ring R into a ring R !. Then 
the following assertions hold. 

(i) /(0) = O', where 0' is the zero of R!. 

(ii) /(—a) = —/(a) for all a £ R. 

(iii) f(R) = {/(a) | a £ R} is a subring of R'. 

(iv) If R is commutative, then f(R) is commutative. 

Suppose R has an identity and f(R) - R!. Then 
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(v) R! has an identity, nam.ely, /(1). 

(vi) If a E R is a unit, then f(a ) is a unit in R' and 

/(a)" 1 = /(a- 1 ). ■ 

We point out that in (v) of Theorem 11.3.3, if / is not onto, then R' may or 
may not have an identity. Even if R! has an identity, the identity of R need not 
map onto the identity of R!. We illustrate this point later in Example 11.3.7. 

Definition 11.3.4 Let f be a homomorphism of a ring R into a ring R'. Then 
the kernel of f, written Ker f, is defined to be the set 

Ker f = {a e R \ /(a) = 0 / }. 

From Theorem 11.3.3, we know that 0 E Ker /. 


Example 11.3.5 The identity m,ap of a ring R is a homom.orph.ism. (in fact, 
a,n isomorphism ,). Its kernel is {0}. Let R and R' be rings and. f : R R! be 
defined by f(a) — O' for all a E R. Then f is a homomorphism, of R into R' 
and Ker f = R. 

Example 11.3.6 Let f be the m.apping from. Z onto Z n defined by f(a) = [a] 
for all a 6 Z. From, Example 5.1.4, /(a + b) = /(a) + n /(&) for all a,b E Z. 
Also, /(a • 6) = [ab] — [a] • n [6] = /(a) - n /(6) for all a,b E Z. Thus, f is a 
homomorphism, of Z onto Z n . As in Example 5.1.4, Ker f = {qn \ q E Z}. 

In the following example, we show that if / is a homomorphism from a ring 
R with 1 into a ring R' with 1 and / is not onto, then the identity of R need 
not map onto the identity of R !. 

Example 11.3.7 Consider the direct sum, Z©Z of Z with itself (see Exercise 
17, page 283). Define f : Z —> Z © Z by f(a ) = (a,0) for all a E Z. From, the 
defi.ni.tion of /, / is well defined. Now for all a,b E Z, f(a + b) = (a + b, 0) = 
(a, 0) + (b, 0) = /(a) + f(b) and f{ab) = (ab , 0) = (a, 0)(6,0) = /(a)(6). Thus, 
f is a homomorphism,. Also, Ker f = {0}. Now /(1) = (1,0). but (1,1) is the 
identity of Z © Z. Therefore, the identity of Z does not m,ap onto the identity 
of Z © Z. 


Consider the rings Z and Q. Suppose Z ~ Q. Then the groups (Z, +) and 
(Q, +) are isomorphic. However, this is not possible since (Z,+) is a cyclic 
group and (Q,+) is not a cyclic group. In the following example, we give 
another argument to show that Z is not isomorphic to Q. 
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Example 11.3.8 Suppose Z ~ Q. Let f : Z —► Q be an isomorphism. Then 
/(1) = 1 and /(0) = 0. Let n be a positive integer. Then f(n ) = /(I + • • ■ + 1) 

y ^ y 

n times 

— /(1) + /(1) + ••• + /(1) = n/(l) = nl = n. iVoro suppose that n is a 
negative integer. Let n = —m, where m is positive. Then f(n ) = /(— m) = 

/(-l - 1-1) - -/(l) - /(l)-/(l) - m(-f( 1)) = -m/(l) = 

—ml — —m — n. Hence, f(n) = n for all n E Z. Let 0 -=fi | E Q\Z. Smce / is 
onto Q, there exists n € Z such that | = /(n) = n, which is a contradiction. 
Hence, Q is not isom,orphic to Z. 

In the following example, we consider two rings which look similar, but 
which are not isomorphic. 

Example 11.3.9 In this example, we show that the ring Z[\/3] = {a 4- b\/ 3 | 
a, b E Z} a,nd, the ring Z[\/5] = {a+by/5 \ a, b E Z} are not isomorphic. Suppose 
there exists an isomorphism, f : Z[\/3] —> Z[\/5]- Now 3 = (0 + \/3) 2 . Thus, 
/(3) = 3) 2 ) = (/(%/3)) 2 . Since f is an isomorphism, we have /(1) = 

1. This implies that /(3) = 3. Hence, 3 = (/(\/3)) 2 - .Since /(%/3) E Z[\/5], 
f(y/3) — a + by/ 5 /or some a + 6\/5 E Z[\/5]- Therefore, 3 = (a + 6\/5) 2 and so 
3 = a 2 +56 2 +2a&\/5- If a b = 0, then 3 = a 2 +56 2 . But there do not exist integers 
a and b such that ab = 0 and 3 = a 2 + 5 b 2 . If ab ^ 0, then \/5 = 3 ~ - a 2 2 ffl ^ 5b2 - E Q, 
which is a contradiction. Hence, Z[\/3] and Z[\/5] are not isomorphic. 

The next example shows that the ring Z n and the ring Z/ (n) are isomor¬ 
phic. 

Example 11.3.10 Consider the ideal, (n) generated by a fixed, positive integer 
n E Z. By Corollary 11.2.9, (n) = {qn \ q E Z}. The cosets of (n) in Z are 
a + (n) = {a + grc | g E Z}- Now 

Zj (n) — {a + (n) | a E Z}. 

Define f : Z n —> Z/ (n) by /([a]) = a + (n) /or all [a] E Z n . ITe recall that 
f is an isomorphism, of ( Z n , + n ) onto (Z/ (n ), +) (Example 5.1.15). Now 
/([a] - n [6]) = /(H) = ab+(n) = (a + (n))(6 + (n)) = /([«])/([&]). Thus, f is 
a ring isomorphism, of Z n onto Zj (n) . 

Theorem 11.3.11 Let f be a homomorphism, of a ring R into a ring R'. Then 
Ker f is an ideal of R. 


Proof. Since 0 E Ker /, Ker / ^ <p. Let a, b E Ker /. Then f(a — 6) — 

/(a) — /(&) — O' — 0' = O' and so a — b E Ker /. Let r € R. Then /(ra) = 

/(r) • /(a) = /(r) • O' = O' and so ra E i?. Similarly, ar E Ker /. Hence, Ker / 

7feoiAe»tafL<ra/ “PlLy-Slc-A. 
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is an ideal of R. ■ 

In the remainder of the section, we consider isomorphism theorems which 
are parallel to those for groups (Section 5.2). 

Theorem 11.3.12 Let R be a ring and I be an ideal of R. Define the mnpping 
g : R —> R/I by g(a ) = a +1 for all a € R. Then g is a homomorphism, called 
the natural homomorphism, of R onto R/I. Furthermore, Ker g = I. 


Proof. Now for all a, b € R, g{a + 6) = (a + 6) + I = (a + I) + (b + I) — 
g{a) + g{b) and g(ab) — ab + I — (a + I)(b + I) — g(a)g(b). That Ker g ~ I 
follows from Theorem 5.1.12 in group theory. ■ 

Theorem 11.3.13 Let f be a homomorphism, of a ring R onto a ring R! and 
I be an ideal of R contained in Ker f. Let g be the natural homomorphism, of 
R onto R/I. Then there exists a unique homomorphism, h of R/I onto R! such 
that f = ho g. Furthermore, h, is one-one if and only if I = Ker f. 


Proof. Once again, we use the work already done for groups. Define 
h : R/I —> R' by h{a + I) = /(a) for all a £ R. We have the desired re¬ 
sults by Theorem 5.2.1, once we verify that h, preserves multiplication. Now 
h((a + I)(b + /)) = h(ab + I) = f(ab ) = f(a)f{b) = h(a + I)h(b + /). ■ 

The proof of the following theorem is similar to that of the first isomorphism 
theorem for groups. We omit the proof. This theorem is also known as the 

fundamental theorem of homomorphisms for rings. 

Theorem 11.3.14 (First Isomorphism Theorem) Let f be a homomor¬ 
phism of a ring R into a ring R!. Then f(R) is an ideal of R' and 

R/Ker f ~ f(R). ■ 

We state the following theorem without proof. Its proof is a direct transla¬ 
tion of the proof of the corresponding theorem for groups. 


Theorem 11.3.15 (Correspondence Theorem) Let f be a homomorphism, 
of a ring R onto a ring R'. Then f induces a, one-one inclusion preserving cor¬ 
respondence between the ideals of R containing Ker f and the ideals of R' in 
such a, way that if I is an ideal of R containing Ker /, then /(/) is the corre¬ 
sponding ideal of R ', and if I' is an ideal of R ', then is the corresponding 

ideal of R. ■ 


Pttte TfeafAewtafLca/ ~Ph.y-A.LG-A. 
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An example similar to Example 5.2.13 can be developed to illustrate The¬ 
orem 11.3.15 

The next two isomorphism theorems for rings correspond to Theorems 5.2.8 
and 5.2.6, respectively. 

Theorem 11.3.16 Let f be a homom,orphism, of a ring R onto a ring R ', I 
be an ideal of R such that I D Ker /, g , and g' be the natural hom.om.orphism.s 
of R onto R/I and R! onto R '//(/), respectively. Then there exists a unique 
isomorphism h of R/I onto R'/ f (I) such that g' o f = h o g. ■ 

Corollary 11.3.17 Let 1 1 , I 2 be ideals of a, ring R such that I\ C I 2 . Then 

c R/h)/(I 2 /Ii ) =s R/h■ ■ 

Theorem 11.3.18 If I and J are ideals of the ring R, then 1/(1 (1 J) ~ 
(I + J)/J. ■ 


11.3.1 Worked-Out Exercises 

<(> Exercise 1 Show that the function / : Zq —> Z 10 defined by /([a]) = 5[a] 
for all [a] G Zq is a ring homomorphism of Z§ into Zjq. 

Solution: We first show that / is well defined. Let [a] = [ b ] in Z6- Then 
a —6 is divisible by 6. Thus, a = 6/c+6 for some k G Z. Now 5a = 30k+5b shows 
that 5[a] = [5a] = [30fc+56] = [30fc]+ 10 [56] = [0]+io5[&] = 5[6] in Zio- Therefore, 
/([a]) = /([£>]). Thus, we find that / is well defined. Let [a], [6] G Zq. Then 
/(H +6 M) — /([a + b]) = 5[a + b] = 5([a] +10 [b]) = 5[a] +10 5[6] = /(a) +10 f(b) 
and /([a] *6 [b]) = f([ab\) = 5 [ab\ = 25 [ab] (since Z 10 is of characteristic 10) = 
(5[a]) no (5[6]) — /(a) -jo f(b). Hence, / is a homomorphism. 

0 Exercise 2 Let R be the field of real numbers. Let a be an automorphism 
of R. Show that a(x) = x for all x G R. 

Solution: Since a is an automorphism of R, o(0) = 0, and o;(l) = 1. 
Let n G N. Then a(n) = a(l + 1 + • • • + 1) = cc(l) + ck( 1) + • * ■ + a?(l) = 

1 4 - 1 + •••-(- 1 = n. Now let m G Z and m < 0. Let n = —m > 0. Then 

0 ( 777 *) = o(— 77 ) — — 0 ( 77 ) = —77 = 777 . This shows that a(x) = x for all x G Z. 

Let ^ G Q. Then o(^) = a(pq~ l ) — a(p)a(q~ l ) = pa(q)~ l — pq~ x = This 

shows that ot(x) = x for all x G Q. Let x G R be such that x > 0. Then x — y 2 
for some y G R. Thus, o(x) = a(y 2 ) = ce(yy) — a(y)a(y) = ot(y ) 2 > 0. Now 
let a, b G R be such that a > b. Then a — 6 > 0. Hence, o(a — 6 ) > 0 and so 
a(a) — a( 6 ) > 0, i.e., a(a) > 0 ( 6 ). Therefore, a is order preserving. We now 
show that a is continuous. Let e G R and e > 0. Since a is onto R, there exists 
8 > 0 such that o(<5) = e. Now let x, y G R be such that |x — y\ < 8 . Thus, 


—8 < x 


-y < 8 . 



11.3. HOMOMORPHISMS AND ISOMORPHISMS 


314 


Since a is order preserving, 

a(—S) < a(x — y) < a(<5). 


Therefore, 


—e < a(x — y) < e 


and so 


—e < a(x) — a(y) < e. 


This implies that 


\a(x) - a(y)\ < e. 

Hence, a is continuous. Now let x £ R. Since Q is dense in R, there exists a 
sequence {a n } of rational numbers such that 

lim a n = x. 

n—>oo 


Since a is continuous, 

a (a:) = a( lim a n ) = lim a(a n ) = lim a n = x , 

n—>oo n —►oo n —>oo 

proving the result. 

0 Exercise 3 Let R be a ring with 1. If the characteristic of R is 0, show that 
R contains a subring isomorphic to Z. 

Solution: Let T — {nl \ n E Z}. Since 0 = 01 G T, T ^ <fi. Let a = nl and 
b = ml be two elements of T. Then a — b — nl — ml = (n — m) 1 and ab = 
(nl)(ml) — (nm)l. Hence, a — b, ab E T. Thus, T is a subring of R. Suppose 
n,m are two integers such that nl = ml. If n > m, then (n — m)l = 0. This 
contradicts the assumption that R is of characteristic 0. Similarly, m > n also 
leads to a contradiction. Hence, n = m. Thus, we find that for each a E T, 
there exists a unique integer n such that a = nl. Hence, the mapping / : Z —> T 
defined by /(n) = nl is an isomorphism. 

Exercise 4 Let p be a prime integer. Show that there are only two noniso¬ 
morphic rings of p elements. 


Solution: It is known that (Z p ,+ p ) is the only group of order p (up to 
isomorphism). Define ©i and ©2 on Z p by [a] ©1 [6] = [0] and [a] ©2 [6] = [ab] 
for all [a], [b] 6 Z p . Now Qi and ©2 are well defined and (Z p , + p ,©i) and 
(Z p , + p , © 2 ) are rings. Let R be a ring with p elements. Then (R, +) ~ 
(Z P ,T P ). If R 9^ (Z p ,T p ,0i), then the multiplication of R is not ©]. Let [a] 
be a generator of (Z p , + p ). Now [a] 2 = n[a} for some nonzero integer n. There 


exists an integer m such that m,n = p 1 . Let [6] = m[a]. Then [b] 2 = m 2 [aP = 
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m 2 n[a\ = m[a\ = [b\. Let g be an isomorphism from (Z p , + p ) onto (i?, +). Define 
/ : Z p -»■ Rby f([u]) = ug([b\) for all [u] G Z p .Then f([u]+ p [v]) = f([u + v )) = 
(u + v)p([6]) = ug([b\) + vg([b\) = f([u\) + f([v]) and f([u] © 2 [v]) = f([uv j) = 
(uv)g([b}) = uvg([b ] 2 ) = uvg([b])g([b]) = ug([b])vg([b]) = f([u])f( H). Hence, 
/ is a ring homomorphism. Let c E R. Then there exists [u] G Z p such that 
^(M) — c - Now [ii] = t[a] for some t G Z. Thus, f([tn ]) = tn^([ 6 ]) = tn 
p(m[a]) = tp(mn[aj) = t^([a]) = g(t[a\) = ^([it]) = c. Hence, / is onto R. Since 
jZp| = \R \, it follows that / is one-one. Thus, / is an isomorphism. 


11.3.2 Exercises 

1. Let R denote the set of all 2 x 2 matrices of the form 


, where a 


a b 
—b a 

and b are real numbers. Prove that R is a ring and the function a T bi 
a b 


—b a 


is an isomorphism of C onto R. 


2. Define the binary operations © and ©onZbya ©6 = a + 5— 1 and 
a ©6 = a + b — ab for all a, b G Z. Show that (Z, 0 , ©) is a r'ng isomorphic 
to the ring (Z,+, •). 


3. (i) Show that the rings R and Q are not isomorphic. 

(ii) Show that the rings R and C are not isomorphic. 

(iii) Are the rings Zq and Z 3 x Z 2 isomorphic? 


4. Let T 2 (Z) - 


a b 
0 c 


a, b, c G Z > be the ring of all upper triangular 


matrices over Z. Define / : T 2 (Z) —► Z by for all 


a b 
0 c 


€ T 2 (Z), 


/( 


a 

0 


b 

c 



a. 


(i) Show that / is a homomorphism. 

(ii) Is / an epimorphism? 

(iii) Is / an isomorphism? 

(iv) Find Ker /. 

5. Does there exist an epimorphism from the ring Z 2 4 onto the ring Z 7 ? 

6 . Show that there does not exist a monomorphism from the ring Zg into 
the ring Z\\. 

“Pufuc. 7^a^Lc#fto£uiaZ 
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7. Show that the ring 2Z is not isomorphic to the ring 3Z. 

8. Let R be a Boolean ring. If {0} and R are the only ideals of R, prove 
that R ~ Z 2 - 

9. Show that the ring Z is not isomorphic to any proper subring of Z. 

10. Is the ring Q[\/2] isomorphic to the ring Q[\/3]? 

11. Let / : R —► S be a nontrivial homomorphism from a field R onto a ring 
S. Prove that S' is a field. 

12. Let R be a ring with 1. If R is of characteristic n > 0, show that R 
contains a subring isomorphic to the ring Z n . 

13. Show that there exist only two homomorphisms from R into R. 

14. Prove that every ring R is isomorphic to a subring of M n (R ), the ring of 
n x n matrices over R. 

15. Let / be a homomorphism of a ring R onto a ring R!. Prove that 

(i) if I is an ideal of R, then /(/) is an ideal of R'; 

(ii) if I' is an ideal of R', then f~ l (I') is an ideal of R and / -1 (J / ) D Ker 

/; 

(iii) if R is commutative and I and J are two ideals of R , then f(I + J) — 
f(I) + f(J) and /(/J) = /(/)/( J). 

16. In each of the following exercises, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) There exist only two homomorphisms from the ring of integers into 
itself. 

(ii) The mapping / : Z —> Z defined by f(n) = 3n is a group homomor¬ 
phism, but not a ring homomorphism. 

(iii) The only isomorphism of a ring R onto itself is the identity mapping 
of R. 

(iv) Let R be a ring with 1. Let f : R S be a ring homomorphism. 
Then /(1) is the identity element of S. 

(v) A nonzero homomorphism from a field into a ring with more than 
one element is a monomorphism. 

(vi) Every nontrivial homomorphic image of an integral domain is an 
integral domain. 


“Pi lHjc. 
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Richard Dedekind (1831-1916) was 
born on October 6, 1831, in Brunswick, Ger¬ 
many, the birthplace of Gauss. He was the 
youngest of four children. 

In 1848, Dedekind went to Collegium 
Carolinum, an institution attended by Gauss, 
where he became a master in analytic ge¬ 
ometry, algebraic analysis, differential and 
integral calculus, and higher mechanics. In 
1849-1850, he gave private lessons in math¬ 
ematics. He matriculated, in 1850, at the 
University of Gottingen. 

After four semesters, in 1852 Dedekind 
completed his Ph.D. work under Gauss. His thesis was on the elements of the theory 
of Eulerian integrals. Later he determined that his knowledge in some areas of math¬ 
ematics was lacking for advanced study at Gottingen. He then spent the next two 
years, following his graduation, filling the gaps in his education. 

Dedekind started his teaching career in 1854. In 1855, Dirichlet succeeded Gauss in 
Gottingen. Dedekind attended his lectures on various areas of mathematics, including 
the theory of numbers, and became a close friend of Dirichlet. In 1855-1856, he also 
attended Riemann’s lectures on Abelian and elliptic functions. Thus, along with being 
an instructor, he was also a student. 

Dedekind was the first university teacher to lecture on Galois theory. He introduced 
the concept of a field, replaced the concept of a permutation group by the abstract 
group concept, and, in 1858, introduced a purely arithmetic definition of continuity. 

Dedekind is most remembered for his concept of “Dedekind cut,” which he intro¬ 
duced in 1872. He was criticized on this theory by mathematicians such as Kronecker, 
Weiestrass, and Russell. 

Dedekind edited the works of Gauss, Dirichlet, and Riemann. In 1871, he sup¬ 
plemented Dirichlet’s lectures, introducing the notion of an “ideal,” a term he coined. 
Later he developed the theory of ideals. He is also credited for such fundamental 
concepts as ring and unit. His treatises on number fields stimulated further develop¬ 
ment of ideal theory. Dedekind also extended Rummer’s work on unique factorization 
domains. His work on abstract algebra influenced Emmy Noether’s work on algebra. 

Dedekind died on February 12, 1916. 






Chapter 12 

Ring Embeddings 


12.1 Embedding of Rings 

Sometimes it is worthwhile to study the properties of a ring by considering it 
as a subring of some ring with more ring properties than itself. A ring without 
identity lacks important arithmetic properties, in particular, a fundamental 
theorem of arithmetic. As another example, in the ring E of even integers, we 
cannot say that 2 divides 2 since 1 ^ E. Now E is a subring of Z and 1 € Z. In 
Z, it is true that 2 divides 2. The main aim of this section is to embed a ring 
into a suitable ring with additional properties. The main feature of this section 
is that any integral domain can be embedded in a field. The proof of this result 
yields a rigorous construction of the rational numbers from the integers. 

Definition 12.1.1 A ring R is said to be embedded in a ring S if there exists 
a m.onom.orphism of R into S. 

From the above definition, it follows that a ring R can be embedded in a 
ring S if there exists a subring T of S such that Rc=lT. 

In the next theorem, we show that any ring R can be embedded in a ring 
with identity. 

Theorem 12.1.2 Any ring R can be embedded in a ring S with 1 such that R 
is an ideal of S. If R is commutative, then S is commutative. 

Proof. Set S — R x Z. Define addition and multiplication as follows: 

(a, m) + (6, n) = (a + b, m + n), 

(a, m) • (b, n) = (ab + na + mb, mn ) 

for all a, b E R and m,n € Z. (Here na means a adds to itself n times if n is 
positive, —a adds to itself |n| times if n is negative, and Oa = 0.) Then S forms 
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a ring under these definitions of addition and multiplication, a fact we ask the 
reader to prove in the exercises. We do note that (0,0) is the additive identity 
and that (0,1) is the multiplicative identity of S. 

Consider the subset R x {0} of S. Since (0,0) € R x {0}, R x {0} ^ 4>. 
Also, for all (a, 0), (6, 0) E R x {0}, (a, 0) — (6,0) = (a — 6, 0) € R x {0}, and 
(a, 0) • (6,0) = (a6,, 0) E R x {0}. Thus, R x {0} is a subring of S. Now for all 
(a, 0) E R x {0} and (c, n) 6 5, (a, 0) • (c,n) = (ac + na, 0) E R x {0} and 
(c, n) ■ (a, 0) = (ca + na, 0) E 1? x {0}. This proves that R x {0} is an ideal of 
S. 

Now define / : R —* R x {0} by /(a) = (a, 0) for all a E R. Then 
/ is an isomorphism of R onto R x {0} and so ~ i? x {0}. Therefore, 
R can be embedded in S. By identifying a E R with (a, 0) E R x {0}, 
we can regard R to be an ideal of S. To show that the commutativity of 
R implies that of 5, let (a,m),(6,n) E S and R be commutative. Then 
(a, m) • (6, n) = (ab + na + mb, mn) = (6a + mb + na, nm) (since R is commu¬ 
tative, ab — ba) = (6,n) • (a, m). Thus, S' is commutative. ■ 

Our main objective in this section is to embed a ring in a field. By Theorem 
12.1.2, every ring can be embedded in a ring with identity. If S were a field, 
then S is commutative and has no zero divisors. This in turn implies that R 
is commutative and has no zero divisors. Thus, if we were to embed a ring 
R in a field S, then R must have at least these two properties, i.e., R must 
be commutative and have no zero divisors. In the next theorem, we embed a 
commutative ring with no zero divisors into an integral domain and then we 
will embed an integral domain in a field. 

Theorem 12.1.3 Let R be a commutative ring with no zero divisors. Then R 
can be embedded in an integral dom.ain. 


Proof. Let S be the ring as defined in Theorem 12.1.2. Let A be the an- 
nihilator of R in S. Then A is an ideal of S by Exercise 17 (page 307). If 
Rfl A — {0}, then the natural homomorphism of R onto the quotient ring 
S/A must map R one-one into S/A , i.e., R can be embedded in S/A. We now 
show that R n A — {0} and that S/A is an integral domain. Let a E R fl A. 
Then ar = 0 for all r E R. Since R has no zero divisors, a = 0. Therefore, 
R H A — {0}. Let 6 + A, c -r A E S / A. If (6 + A) ( c T A) — 0 T A, then be E A. 
Thus, (6c)r — 0 for all r E R. Suppose c +A / 0 +A, i.e., c (/ A. Then there 
exists r E R such that cr ^ 0. Since R is an ideal of S, cr E R, and for all 
s E R, 6s E R. Now (cr)(bs) = (bcr)s = 0s = 0. Also, R has no zero divisors 
and cr 0. Therefore, we must have bs = 0. This implies that 6 E A and so 
6 + A = 0 + A. Hence, S/A is an integral domain. ■ 


“Public. 
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Suppose we are given the ring of integers Z and we are asked to construct 
the rational numbers from Z. We can think of any integer as n/1, i.e., n divided 
by 1. However, we must somehow pick up the fractions which cannot be reduced 
to having a 1 for a denominator. One idea that suggests itself is to consider 
the Cartesian product Z x Z and consider the first component of the elements 
of Z x Z as the numerator and the second component as the denominator. 
However, the ordered pairs (3,2) and (6,4) are distinct. A common technique 
used in mathematics suggests putting these elements in the same equivalence 
class so that they become “equal.” This is precisely what we shall do. Let’s 
also remember not to have 0 in the denominator. 

Theorem 12.1.4 Any integral domain R can be embedded in a fi,eld,. 

Proof. Let S = R x (i?\{0}). Define the relation ~ on S by for all 
(a, 6), (c, d) G S, (a, 6) ~ (c, d) if and only if ad = be. Then ~ is an equivalence 
relation. The reflexive and symmetric properties are immediate. Suppose that 
(a, b) ~ (c, d) and (c, d) ~ (e, /). Then ad = be and cf = de. This implies that 
adf = bef and bef = bde and so adf — bde. Canceling d , we obtain af = be , 
i.e., (a, b ) ~ (e, /). Hence, ~ is transitive. Now ~ partitions S into equivalence 
classes. Denote the equivalence class {(c, d) E S | (c, d) ~ (a, 6)} by a/b. Set 

F = {a/b | (a, b) G S}. 

Define + and • on F as follows: 

a/b + c/d = (ad + bc)/bd, 
a/b-c/d = ac/bd 

for all a/6, c/d G F. We show that + is well defined. Let a/6, c/ d,a'/b',c'/d' G 
F. Suppose a/b — a'/b' and c/d — c'/d'. Then ab' = ba' and cd! = dc f . Therefore, 
ab'dd' — ba'dd' and cd'bb' = dc'bb'. Hence, 

ab'dd! + cd'bb' — ba'dd' + dc'bb ', ; 


and so 


Thus, 


and so 


(ad + bc)b'd! = bd(a'd' + 6V). 
(ad + be, bd) ~ (a'd' + b'c', b'd ') 
(ad + be)/bd = (a'd' + b'c')/b'd'. 


A similar proof shows that * is well defined. 
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The reader is asked to verify the associative, commutative, and distributive 
laws for F. The additive identity of F is 0/6 and the multiplicative identity of 
F is 6 / 6 , where 6^0. For a /6 E F , the additive inverse is 

(—a)/6 - a/ (—6) 

and the.multiplicative inverse is 6 /a (when a ^ 0). Thus, F is a field. 

We now show that R can be embedded in F. Let 

R! = {a/1 | a E C F. 

Then R' is a subring of F. Define / : R —» R' by /(a) = a/1 for all a E R. 
Then a — 6 if and only if a • 1 = 1 • 6 if and only if a/1 = 6/1 if and only if 
/(a) = /( 6 ). Hence, / is a one-one function. Now 

f(a + 6 ) = (a + 6 )/l = (a • 1 + 1 ■ 6)/1 • 1 = a /1 + 6/1 = /(a) + /( 6 ) 

and 

f(ab) = ab/l — ab /1 • 1 = a /1 • 6/1 — /(a) • /( 6 ). 

From the definition of /, / is onto R!. Thus, / is an isomorphism of R onto 
R' CF.M 

The above theorem gives another instance of the power of the concept of 
an equivalence relation. We have once again used the notion of an ordered pair 
in a fundamental manner. 

Definition 12.1.5 Let R be an integral domain. A field F is called, a quotient 
field of R or a field of quotients of R if there exists a subring R\ of F such 
that 

(i) R ~ R\ and 

(ii) for all xEf, there exists a, 6 E R\ with 6 7 ^ 0 such that x — ab~ 1 . 

Let us now show that for the given integral domain R, the field constructed 
in Theorem 12.1.4 is a quotient field of R. Let x E F. Then x = a/ 6 , where 
(a, 6 ) E S. Now (a, 1) E S and (6,1) E S. Thus, a/1, 6/1 E R' and a /6 = 
a /1 • 1/6 = (a/1) • (6/1) -1 . Hence, F is a quotient field of R. We call F the 

quotient field or the field of quotients or R. 

Theorem 12.1.6 Let R be an integral dom.a,in and. F its field, of quotients. Let 
R! be an. integral d,om.ain contained in a fi.eld K' and set 

F' = {a'( 6 / ) _1 j a', b' E R', b' 7 ^ 0}. 

Then F' is the smallest subfield of K' which contains R' and, any isomorphism, 
of R onto R' has a unique extension to an. isomorphism, of F onto F 1 . 

~Pu*uc- 
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Proof. By Exercise 2 (page 323), F' is the smallest subfield of K' which 
contains R'. Let / be an isomorphism of R onto R!. Let a /6 E F. If /(a) = a ' 
and /(6) = b r , define g : F —> F' by 

g{a/b) = a!(b')~ l = /(a)/( 6 ) _1 . 


Identifying the ring R with the set {a/1 | a E R}, it is clear that / = g\ R . 
Now a/b — c/d if and only if ad = be if and only if f(ad) = /( 6 c) if and only 
if f(a)f(d) = /( 6 )/(c) if and only if /(a)/( 6) _1 = /(c)/(d ) _1 if and only if 
g{a/b) = g{c/d). Therefore, g is a one-one function. From the definition of p, 
it follows that g is onto F f . Now 


g(a/b + c/d) 


g((ad + be)/bd) 

f(ad + bc){f {bd))~ l 

[f{a)f{d) + f (b) f (c) }[f (b)~ l f (d)~ x ] 

/(a)/( 6 )- 1 +/(c)/(d )- 1 

g(a/b) +g{c/d) 


and 


g{a/b ■ c/d) 


g(ac/bd) 
f(ac)(f(bd ))~ 1 

[/(«)/(c)][/(6)- 1 /(d)^ 1 ) 

/w/rvw/w - 1 

g(a/b)g(c/d) 


for all a/ 6 , c/d E F. Thus, g is an isomorphism of F onto Fb 
Let p' be any other isomorphism of F onto F' such that / 


g'\ R . Then 


g'(a/b) = p/a /1 • ( 6 / 1 )- 1 ) 

= 5'(a/l)p/(6/l)-i) 

= P , (a/ 1 )p / ( 6 /l )- 1 

= f( a )f( b ) 1 

= ^(a/6) 

for all a/b E F and so g' — g. Thus, there is a unique extension of /. ■ 


We can conclude from this result that the field of quotients F of an integral 
domain R is “the” smallest field containing R in the sense that there does not 
exist a field K such that R C K C F. 

The field F' in Theorem 12.1.6 is called the quotient field of R' in K. In 
view of Theorem 12.1.6 and the comments preceding it, we do not differentiate 
between the notation a/b and a 6 -1 for the elements of F. 


~Pufuc- 
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12.1.1 Worked-Out Exercises 

<0 Exercise 1 Let D = {| € Q | 5 does not divide 6}. Show that D is a 
subring of Q with 1. Find the quotient field of D. 

Solution: Let a/6, c/d E D. Since 5 does not divide 6 and 5 does not divide 
d, 5 does not divide bd. Thus, (ad — bc)/bd E D and ac/bd E D. Hence, D is a 
subring of Q. Also, 1 = 1/1 E D. Since ZCDCQ and Q is the quotient field 
of Z, Q is the quotient field of D. 

Exercise 2 Let S be a ring and / a one-one func + on of S onto a set T. Show 
that suitable addition and multiplication can be defined on T so that T 
becomes a ring isomorphic to S under /. 

Solution: Define binary operations + and • on T as follows: Let £i ,£2 € T. 
Since / maps S onto T, there exist si,S 2 E S such that f(s 1 ) = t\ and /(S 2 ) = 
£ 2 . Define 

£1 T £2 = f(si + 82 ) and 
*1 • <2 = flsis 2 ). 

First we show that both these binary operations are well defined. Let £1, £2, £3, £4 
E T be such that £1 = £3 and t 2 = £4. Since / maps S onto T, there exist 
S1,S 2 ,S3,S 4 E S such that f(s{) = £1, f(s 2 ) = £ 2 , /(s 3 ) = h, and /(s 4 ) = £4. 
Therefore, f(s\) = f(s 3 ) and f(s 2 ) = /(s 4 ). Since / is one-one, = S3 and 
s 2 — s 4 . Hence, £i + £2 = f{s\ +s 2 ) = /(s3 + s 4 ) = £ 3 + £ 4 and £1 -t 2 = f(s\s 2 ) = 
/(^354) = £3 -£ 4 . Thus, + and • are well defined. It is now a routine verification 
to show that (T, + , •) is a ring. We verify some of the properties and leave others 
as an exercise. First we show that + is associative. Now t 2 + £3 = f(s 2 +S3) 
and £1 +£ 2 = /(«i + s 2 ). Thus, £i + (£ 2 +£3) = f{si + (s 2 + s 3 )) = f((si +s 2 ) +s 3 ) 
(since + is associative for S) = (t\ 4- 1 2 ) + £ 3 . Hence, + is associative for T. 
Also, /( 0 ) + £1 = /(0 + si) = f(si) = f(s\ + 0 ) = £1 4 - /(0). This implies that 
/(0) is the additive identity. Similarly, we can verify the other properties of a 
ring. It is immediate that / is a homomorphism and since / is one-one and / 
maps S onto T, S is isomorphic to T. 


12.1.2 Exercises 

1 . Prove the associative, commutative, and distributive laws in Theorem 
12.1.4. 

2. Let R be an integral domain, which is a subring of a field F. Let F' = 
{ab~ l | a, 6 E /£, 6 7 ^ 0}. Show that F' is a subfield of F. Furthermore, 
show that F' is the smallest subfield of F which contains R. 

~Pu*uc- 7fea£Ae»tait<ra/ 
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3. Let R and R! be integral domains contained in fields. Set F — {ab~ l \ 
a, b E R, b / 0} and F' = {a'b'~ l \ a', b' E R !, b' / 0'}. Suppose / is 
an isomorphism of onto R'. Prove that / has a unique extension to an 
isomorphism of F onto F'. 

4. Prove that any field R is equal to its field of quotients F in the sense that 
f(R) — F, where f is the isomorphism defined in Theorem 12.1.4. 

5. Prove that isomorphic integral domains have isomorphic fields of quo¬ 
tients. 

6 . Find the field of quotients of the integral domains Z[i] and Z[\/2]- 

7. Let R be a ring of characteristic n > 0 and 

R x Z n = {(r, [m]) | r E R and [m\ E Z n }. 

Define + and • on R x Z n by 

(a, H) + (6, [t\) = (a + b ) [vn + f]), 

(a, H) • (b, jfj) = {ab,[mt}) 

for all a,b e R, [m], [f] E Z n . Prove that 

(i) the above two operations are well defined, 

(ii) (R x Z n , +, •) is a ring with 1, 

(iii) (R x Z n , +, •) is of characteristic n, 

(iv) there exists a monomorphism from R into (R x Z n , +, •). 

8 . Let S and R' be disjoint rings with the property that S contains a subring 
S' such that there is an isomorphism f of S' onto R'. Prove that there 
is a ring R containing R' and an isomorphism f of S onto R such that 

f = fls 1 - 


~PuFlc. 



12.1. EMBEDDING OF RINGS 


David Hilbert (1862-1943) was born 
on January 23, 1862, in Konigsberg, Ger¬ 
many. Hilbert’s inclination toward mathe¬ 
matics is believed to be due to his mother. 

He attended the University of Konigsberg 
from 1880 to 1884, and received his Ph.D. 
in 1885. 

Heinrich Weber, Richard Dedekind’s col¬ 
laborator on the theory of algebraic func¬ 
tions, was a professor at the University of 
Konigsberg while Hilbert was a student. In 
1883, after Weber left, Lindeman was ap¬ 
pointed as his successor. Lindeman’s influ¬ 
ence caused Hilbert to become interested in 
the theory of invariants. 

Hilbert proved the famous Hilbert basis theorem—that is, if every ideal in a ring R 
has a finite basis, then so does every ideal in the polynomial ring R[x]. Hilbert’s results 
connected the theory of invariants to the fields of algebraic functions and algebraic 
varieties. He also proved the Hilbert irreducibility theorem. 

Hilbert also worked on algebraic number theory. This work centers on the reci¬ 
procity law, developed from Gauss’s law of quadratic residues. 

In 1893, Hilbert, along with Minkowski, was assigned to prepare a report on num¬ 
ber theory. Minkowsky soon withdrew from this project. Hilbert summarized the 
known results in Zahlbericht. For half a century, it was a bible for anyone interested 
in learning algebraic number theory. In 1899, Hilbert published Grundlagen der ge¬ 
ometric, which went into its ninth edition in 1962. After 63 years, the book was still 
being read, although it was slowly modernized. 

In 1900, while addressing the International Congress of Mathematicians on mathe¬ 
matical problems, Hilbert introduced 23 problems. These have since stimulated math¬ 
ematical investigations. 

Dirichlet’s principle, which was used in boundary value problems, had been dis¬ 
credited by Weierstrass’s criticism. Hilbert salvaged Dirichlet’s principle by proving it 
in 1904. 

Hilbert worked on algebraic forms, algebraic number theory, foundations of geom¬ 
etry, analysis, and theoretical physics. Many of his students became famous mathe¬ 
maticians, including Herman Weyl. Hilbert died on February 14, 1943. 






Chapter 13 

Direct Sum of Rings 


In this chapter, we construct some new rings from a given family {Ri \ i 6 1} of 
rings. For this purpose, we introduce the complete direct sum, the direct sum, 
and the subdirect sum of this family. The results developed in this chapter 
also help us to obtain structure results of rings. 

13.1 Complete Direct Sum and Direct Sum 

Let {Ri | % 6 /} be a family of rings indexed by a nonempty set I . The Cartesian 
product | i G 1} of the sets Ri is the set of all functions / : I —> U{I?i | 

i G 1} such that f(i ) € Ri for all i £ I. Let /, g £ II{i^ ] i £ /}. Define / + <?, 
fg by 

(/ + S )(0 = f(i)+g(i) 

CfaXO = 

for all i £ I. Then / + <?,/<? 6 n{i^ | i £ I}. It can be easily verified that 
II{^ | i G 1} together with the above two operations is a ring. This ring is 

called the complete direct sum of the family of rings {Ri \ i G 1} and is 

denoted by Il^/i^. The zero element of Ui^jRi is the function 0 : / —» U {Ri 
| i G 1} defined by 0(z) = 0*, the zero element of Ri, for alH G I. The additive 
inverse of / G U^jRi is the function —/ : I —> U{/^ | i G 1} defined by 
(—/)(*) — — f{i) 6 Ri for all i G /. Let / G II i^jRi and let f(i ) = G Ri 
for all i G I. Usually / is identified with the image set {a* | i G I}. Using this 
notation, the above two operations can be defined by 

{ui | i G /} + {bi | i G /} = {&i + bi \ i G /} 

{<2i | i G /} ■ {bi | i G /} = {&ibi | i G /} 

for all a*, bi G Ri for all i £ I. 

Suppose now that / is a finite set, say, I = {1,2,... , n}. In this case, the 
complete direct sum is denoted by (Bi^jRi — R\ © R 2 ® • • ■ © R n and an element 
{a* | i G 1} is usually written as an n-tuple (ai, < 22 ,..., a n ) . 
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Definition 13.1.1 The direct sum of a family of rings {Ri \ i £ /}, denoted 
by QAeiPi, is the set 

®izlRi = {{© | i £ 1 } £ Tli<=iRi | ai fi- 0 for at m.ost finitely many i £ I}. 

Theorem 13.1.2 Let {Ri \ i £ 1} be a family of rings. Then 

(i) is a subring of the complete direct sum of rings II i<=.iRi\ 

(ii) for all k £ I, the function ik : Rk (B^jRi defined, by 

ik(a ) = {{a.j | i £ 1 } | ai = 0 for all i ^ k and, a = a} 

for all a £ Rk, is a m,onomorphism, of rings; 

(Hi) for all k £ I, ik(Rk) is an ideal o/ffiie/i?*. 

Proof. (i) Let {a; | i £ 1} and {5* | i £ 1} be two elements of ©i € /i^. Since 
ai 7 ^ 0 for at most finitely many i £ I and bi / 0 for at most finitely many 
i £ /, it follows that ai — bi 7 ^ 0 for at most finitely many i £ I and 0 

for at most finitely many i £ I. Hence, {a* \ i £ 1} — {bi \ i £ 1} £ and 

{ai | i £ I}{bi | i £ 1} £ Thus, (Bi € jRi is a subring. 

(ii) Let a,b £ Rk- Then ik(a + 6 ) = {{a* | i £ /} | a z — 0 for allz 7 ^ k and 
ak — a + b} — {{a' j i £ /} | a' k — 0 for all i ^ k and a[ = a}+ {{£>■ | i £ 1} \ 
b'i — 0 for all i k and b' k = 6 } = fjt(a) + ik(b). Similarly, ik(ab) = ik{a)ik{b). 
Thus, ik is a homomorphism. By the definition of ik, we find that ik is one-one. 
Hence, ik is a monomorphism. 

(iii) Since ik is a monomorphism, ik(Rk) is a subring of Let {6* | 

i £ 1} £ and {ai | i £ 1} £ ik{Rk)- Since ai — 0 for all i ^ k, biai = 0 

for all i 7 ^ k. Also, for i — k, bk,ak £ Rk ■ Therefore, £ Rk - Thus, {bi \ 
i £ I}{ai | i £ /} £ ik{Rk ), proving that ik(Rk ) is a left ideal. Similarly, {ai \ 
i £ I}{bi | i £ /} E ik(Rk)- Hence, ik{Rk ) is an ideal. ■ 

By Theorem 13.1.2, we find that Rk is isomorphic to the subring ik(Rk) of 
©ig/iiT Identifying Rk with ik{Rk)> we can say that contains Rk as an 

ideal. 

Let I = {1, 2, ...,ra} and {Ri \ i £ /} be a finite family of rings. From the 
definition of direct sum, it follows that the complete direct sum and the direct 
sum of this family is the same. Hence, by Theorem 13.1.2, we can say that the 
direct sum, Ri © R 2 © • • • © Rn > contains each of R\, R 2 , ..., Rn as an ideal. 

We now investigate the conditions under which a ring R is isomorphic to a 
direct sum of a family of ideals (considering each ideal as a ring) of R. 

Definition 13.1.3 Let I be a finite nonempty set, say, {1, 2, ..., n}, and {Ai 
| i £ 1} be a family of ideals of a, ring R. Then the sum, of this finite family, 
denoted by Jfi<=j Ai, is the set 

^ = { a i + &2 + ■ * ‘ + | ai £ Ai, i = 1 , 2 , ..., n}. 
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If I is empty, then let us take Y)iel Ai = {0}. 

If I = {1, 2, ..., n}, then we also use the notation A\ + A 2 + • • ■ + A n to 
denote the sum YliGi Ai- 

We leave the proof of the following theorem as an exercise. 

Theorem 13.1.4 Let {Ai \ i £ 1} be a finite family of ideals of a ring R. Then 

(i) Yliti Ai is an ideal of R , 

(ii) Ai C X^ e/ Aj for all i £ I, 

(in) if A is an ideal of R such that Ai C A for all i £ I, then Y^izi Ai C A. M 

Definition 13.1.5 Let {Ai \ i £ /} be a family of ideals of a ring R , where I 
is finite or infinite. Then the sum, of this family, denoted by Y)iGi Ai, is the set 

'y ) Ai = {a £ R | a £ A l for som,e finite subset Iq of I}. 
iGI i£I 0 

Theorem 13.1.6 Let {Ai \ i £ 1} be a family of ideals of a ring R. Then 
Ylizi Ai is an ideal of R. 

Proof. Since 0 £ Y)i£i Ai, A% 7 ^ <f>. Let a,b £ Y)i£i Ai and r £ R. 
Then a £ YliGh Ai and b £ Y)iei 2 A% for some finite subsets I\ and I 2 of I. Let 
I 3 = I\ U D- Then a, b £ YliGh Ai- By Theorem 13.1.4, Y^ieh Ai is an ideal 
of R. Hence, a — b, ar , ra £ Y)iGl 3 Ai- Thus, a — b, ar, ra £ Y)iGi Ai and so 
YliGi Ai is an ideal of R. ■ 

Definition 13.1.7 Let {Ai \ i £ 1} be a finite family of ideals of a ring R. A 
sum, Y^iGi Ai of {Ai j i £ 1} is called a direct sum if for all k £ I, 

A k n ^ = {<>}• 

i€l, i^k 

Lemma 13.1.8 Let {Ai \ i £ 1} be a finite family of ideals of a ring R. If 
YjiGi Ai is a direct sum, then for all a £ Ak , b £ Ai, k l, ab = 0. 

Proof. Let a £ Ak, b £ Ai, and kfi^l. Since Ak and Ai are ideals, ab £ Ak 
and ab £ Ai. Since Ai C Yliei, i^k Ai, ab £ Y)i£i,i^kAi- Therefore, ab £ 
Ak n T,iGl,i^k A i- Since £*eJ A is a direct sum, A k fl Lie/, = {0}* 

Hence, ab = 0. ■ 

Theorem 13.1.9 Let {A l \ i £ 1} be a family of ideals of a ring R, I = 
{1, 2,... ,n}. Then the following conditions are equivalent. 

(i) Y)iGi Ai is a, direct sum,. 

(ii) a\ + <22 + ■ • • + a n = 0, ai £ Ai , i £ /, implies that ai = 0 for all i £ I. 
(Hi) Each element a £ Y,i£i Ai is uniquely expressible in the form, 

a = ai + a.2 T • *■ • + a n , 

where ai £ Ai, i £ I. 

“PilHjc. TfeoiAewtatLca/ 
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Proof. (i)=>-(ii) Let a\ -b a<i + • ■ • 4- a n = 0, < 2 ^ £ Ai, i £ I. Let k £ I. Now 

— O-k — a l + a 2 + • • * + a^-l + a k +1 + • ■ • + CLn £ A fa fl Ai — {0}. 

iG/, i^k 

Hence, = 0. 

(ii) =^(iii) Let a — ai + a 2 + • • • + a n = bi + 62 + • • • + 6 n > where a z , b z £ A* 
for all i £ I. Then (ai — 61 ) + (<22 — 62 ) + - • • + (a n — b n ) — 0. Hence, by (ii), 
a i ~ h — 0 for all i G I, i.e., a z = bi for all i G I. 

(iii) =>(i) Let a G A^flX^e/, Then there exist a* G A*, i — 1,2, ..., n, 

such that 

a — — a l + a 2 + ' ■ ‘ + Ofe-l + &fc-fl +•*■•+ a n . 

This implies 

a l + a 2 + ' * ' + Ofe—1 + (~ a k) A Clk +1 + ■ ' ' + &n — 0 . 

Also, 0 —|— 0 —|— • - * —)— 0 — 0. Therefore, by (iii), a* = 0 for all i G I since 0 is 
uniquely expressible as a sum of elements of A*. Thus, A^ flX^e/ i^k ^ — {0} 
and so Ai is a direct sum. ■ 

Definition 13.1.10 A ring R is said to be an internal direct sum of a. finite 
family of ideals {Ai, A 2 ,, A n } if 
(i) R — A\ -(- A 2 T * * ■ T A n and 
(h) A\ + A 2 + • • • + A n is a direct sum.. 

Theorem 13.1.11 Let R be a ring and {Ai j i G 1} be a finite family of ideals 
of R. If R is an internal direct sum of {Ai \ i G I}, then 

R — ®ieiAi. 


Proof. Let I = { 1,2, ... , n}. Suppose R is an internal direct sum of ideals 
A i,A. 2 , -. •, A n . Let a G R. Then a is uniquely expressible in the form a = 
ai + a 2 + • • • + a n , where ai £ Ai, i £ I. Now (ai, af ,..., a n ) £ ® l& jA{. Define 
/ : R -> ©ie/A* by 

f (&) — (®1 j • 1 ^n) • 


Let a,b £ R. Then there exist ai,bi £ Ai, i £ I such that a = ai + <22 + • • • + a n 
and b = b\ + 62 H- b b n . Now a = b if and only if a\ + 02 H- b a n = b\ + 62 + 


- \-b n if and only if = 6 ^ for all i G / if and only if (aj, 02 , ..., a n ) = (fq, 62 , 

..., 6 n ) if and only if /(a) = /( 6 ). This shows that / is a one-one function. 

Let (ai,a 2 , ..., a n ) £ ©i € /A ? . Then a — ai + <22 H - b a n G Xqe/ Ai — R and 

/(a) = (ai,a 2 , ..., a n ). Hence, / is onto ©i e /A^. Finally, we show that / is a 
homomorphism. Since a + b — (ai + 61 ) + (<22 + £> 2 ) H-+ ( a n + 6 n ) 5 we have 


/(a “b 6) ((&1 +^l)t ( a 2 + ^ 2 )) • • • ) ( r* wftlfw y .4 ^ 
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b n ) = /(a) + f(b). By Lemma 13.1.8, for all i, j € /, i / j, aib 3 = 0. From this, 
it follows that ab = a\b\ + 0262 + • • ■ + a n b n . Thus, f(ab) — (ai&i, 0 . 262 , • • -, 
a n b n ) = ( 0 . 1 ,0-2} • • • > On) ( 61 , 62 , ■■■>&«) = /(o)/( 6 ). Hence, / is an isomorphism 
of .R onto proving that R ~ 

If R is an internal direct sum of ideals ^4i, -A 2 ? ■ - •,-An, then we identify R 
with (&i£iAi and we usually write 

R - Ai 0 A 2 © • • ■ © ^In- 

Let us now characterize the direct sum of ideals of a ring R with 1 with the 
help of idempotent elements. 

Theorem 13.1.12 Let R be a ring with 1 and {A\,A 2 , ...,A n ] be a finite 
family of ideals of R. Then R = A\ © A 2 © ■ • • © A n if and only if there exist 
idempotents ei € Ai, i = 1,2, ..., n, such that 

(i) 1 = ei + 62 + • • • + e n , 

(ii) Rei = Ai for all i = 1,2, ..., n, a,nd 

(in) e l e 3 — e 3 e t — 0 for i j. 


Proof. Let R = A\ © A 2 © • • • © A n . Now 1 € it!. Thus, there exist a G Ai, 
i = 1,2, ..., n, such that 1 = ei + e 2 + • • • + e n . Then e* = eie* + e 2 e* + ■ • • + 

ef + • • • + e n Ci. By Lemma 13.1.8, — 0 for all j 7 ^ i. Hence, = ef, i.e., 

is an idempotent for all i = 1,2,..., n. Since G Ai and Ai is an ideal, 
Rei C Ai. Let a G Ai. Then 

o — ol — aei © ae2 © • ■ ■ © oe^ — oe^ G Rei 

since by Lemma 13 . 1 . 8 , ae 3 = 0 for all j 7^ i. Thus, A l C Rei. Therefore, we 

find that Rei = A t . 

Conversely, assume that there exist idempotents e x G A l , i = 1 , 2 , ..., n, 
satisfying the given conditions. Let a G R. Then a = al = a(ei+e2 + - • - + e n ) = 
aei © ae2 + • • • © ae n G Re\ + Re 2 © • ■ • © Re n C A\ © A2 © • ■ • © A n . 
Hence, R — A\ + A2 + • • • + A n . Let us now show that this sum is direct. 
Let a G Ai n (Ai + A2 © • ■ • © Ai —1 © Ai+\ © ■ • • © Ajfi. Then there exist 
Oi, 02,..., a n G R such that o^e^ — o — 01 ei © • ■ • © Gj_ 1 e^_ 1 © a^ie »+i + 
• • • + a n e n . Thus, a = a^i implies that aei — o^e^ = a©* = a and a = 

a\e\ H-b a 2 _iei_i + a*+ie 2 +i 3 -b a n e n implies that aei = aieie* H-b 

ai-\ei-\ei+ Oi+iei+ieH-b a n e n ei = oCH-baO = 0 (since by (iii), e©j = 0 

for i 7^ j). Hence, a = 0 , proving that R = A\ © A2 © ■ • • © A n . ■ 


Let us now consider another type of subring of the complete direct sum 


TlizjRi of a family of rings {R* 


I} $ 


br this 

'iL+LC- 


Het us note that the ma 

TfcafAewtaftca/ 
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7 r*. : n iziRt —» Rk defined by 


ir k {{ai | i£ I}) = a k 

is an epimorphism of the ring II i^iRi onto the ring R k . 7r*. is called the kth 

canonical projection. 

Definition 13.1.13 A subring T ofYli^iRi is called a subdirect sum of the 
family of rings {Ri \ i £ 1} if 7Ti\x (the restriction of n l to T) is an epimorphism, 
ofT onto Ri. We denote T by 0f G/ it^. 

Theorem 13.1.14 A ring S is isomorphic to a subdirect sum, of a family {R z 
| i € 1} of rings if a.nd, only if S contains a family of ideals {Ai \ i £ 1} such 
tha,t Di^jAi = {0}. 


Proof. Suppose S is isomorphic to a subdirect sum of a family {Ri \ i £ 
1} of rings. Then there exists a subring T of Yli^jRi such that S ~ T and 
T = Let a be the isomorphism of S onto T. Then 'Kid : S —> Ri is 

an epimorphism. Let Ai — Ker 7^0:. Then Ai is an ideal of S. Let a £ H^jAi. 
Then ( 71 ^ 0 ;)(a) = 0 for all i £ I. Thus, 7r*(a(a)) = 0, i.e., the ith component of 
a(a) is 0 for all i £ I. Hence, a(a ) = 0. Since a is one-one, a = 0. This proves 
that O ia Ai = {0}. 

Conversely, suppose S contains a family of ideals {A t \ i £ 1} such that 
D i^iAi — {0}. Consider the family {S/Ai \ i £ 1} of quotient rings. Let 
R = TliziS/Ai. Define j3 : S —> R by 

(3(a) - {a + Ai \ i £ 1} 

for all a £ S. Then (3 is a homomorphism. Let a £ S. Now a £ Ker (3 if and 
only if (3(a) =0 if and only if a + Ai = 0 for all i £ I if and only if a e A 2 for 
all i £ I if and only if a £ H i^jAi if and only if a — 0. Therefore, Ker (3 = {0}. 
Thus, (3 is a monomorphism. Let (3(S ) — T. Then T is a subring of R and also 
7r i\x is an epimorphism. ■ 


13.1.1 Worked-Out Exercises 


<0 Exercise 1 An idempotent e of a ring R is called a central idempotent 
if e G C(R). 

Let R be a ring with 1 and e be a central idempotent in R. Show that 

(i) 1 — e is a central idempotent in R ; 

(ii) eR and (1 — e)R are ideals of R\ 

(iii) R = eR 0(1 — e)R. 
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Solution: (i) (1 — e)(l — e) = 1 — e — e + e 2 = 1 — e — e + e = 1 — e. Also, 
for all a £ R, a(l — e) = a — ae = a — ea = (1 — e)a. Hence, 1 — e is a central 
idempotent. 

(ii) Now eR is a right ideal of R. Let a £ R. Then a(eR ) = ( ae)R = ( ea)R 
(since e £ C(R)) = e(aR ) C eR. Hence, eR is also a left ideal. Thus, eR is an 
ideal of R. Similarly, (1 — e)R is an ideal of R. 

(iii) Let a £ R. Then a = ea + a — ea = ea + (1 — e)a £ eR + (1 — e)R. 
Hence, R = eR+ (1 — e)R. Suppose 6 £ eRD (1 — e)R. Then there exist c, d 6 R 
such that b = ec = (1 — e)d. Hence, eb — e 2 c = ec = b and eb = e(l — e)d = 
(e — e 2 )d = (e — e)d = 0 . Thus, b = 0 . As a result, R = eR © (1 — e)R. 

0 Exercise 2 Let A and B be two ideals of a ring R such that R = A 0 B. 
Show that RJA ~ B and R/B ~ A. 

Solution: Let x £ R. Then x can be uniquely expressed as x = a+ 6, where 
a £ A and b £ B. Define / : R —> B by f(x) = b. Clearly / is well defined. Let 
b £ B. Then 6 = 0 + 66 A + B. Hence, fib) = 6, which shows that / is onto B. 
Let x,y £ R. Then there exist ai, <22 6 A and 61,62 £ B such that x = a\ + 61 
and y = a2 + 62- Now z + y = a\ + 61 + <22 + 62 = (ai + 02)+ (61 +62) 6 A + S 
and xy = (ai + 6i)(a.2 + 62) = aia2+ ai&2+ 610-2+ 6162. Since 0162, 6102 6 AnZ? 
and AflH = { 0 }, 0162 = 0 and 6102 = 0. Therefore, xy = 0102+ 6162 £ A + B. 
Hence, /(x + y) = 6 X + 6 2 = /(x) + /(y) and f(xy) = 6 X 6 2 = f(x)f(y). Thus, 
/ is an epimorphism. Therefore, by the first isomorphism theorem (Theorem 
11 . 3 . 14 ), R/Keif ~ B. Let x £ Ker /. Then f{x) = 0 . Since x £ Ker / C R, 
there exist a £ A and b £ B such that x = a + 6. Now f(x) — 6 and this implies 
that 6 = 0 . Therefore, x = a £ A and so Ker / C A. On the other hand, let 
a £ A. Then o = o + 0 eA + H. Therefore, /(a) = 0 and so a 6 Ker /. Thus, 
A C Ker /. Hence, A = Ker / and so R/A ~ B. Similarly, R/B ~ A. 

Exercise 3 Let R — i?i©i?2©- ■ -©i?n he the direct of sum of rings R\, R2 ,..., 
and 1 6 R. Show that an element a — (ai, 02, ..., a n ) £ R is a unit if 
and only if a* is a unit in Ri for all i = 1,2,..., re¬ 


solution: Since 1 6 R = i?i © R2 © ■ • ■ ®Rn, 1 = (ei, e2,..., e n ), where 
is the identity of Ri for alH = 1 , 2 ,..., n. Suppose a = (ai, 02, ..., a n ) £ R is 
a unit. Then there exists 6 = (61,62, ..., b n ) £ R such that ab = 1 = 6a. Thus, 
(01, O2, . . . , CLji) (61,62, ... 5 672) (^l, ^2 j • • ■ j ^72) — (^1} ^ 2 5 • • • 5 ^n) (®1 ? a 2? • • • > 

a n ). From this, it follows that aA = = 6*0* for allf = 1, 2,... ,n. Hence, 

is a unit in Ri for all i = 1 , 2,. .., n. Conversely, assume that a^ is a unit in Ri 
for all i = 1, 2 ,... ,n. Thus, there exists bi £ R4 such that a^6^ = e; = 6^ for 
all z = 1 , 2 , ..., n. Let 6 = (61,62,..., 6 n ). Then a6 = 1 = 60, proving that a is 


a unit. 


“Pi l*ic- TfeoiAewtoZLca/ “PlLySfLc-A- 
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0 Exercise 4 Let R be a direct of sum of rings Ri, R2 ,..., Rn with iden¬ 
tity. Let A be an ideal of R. Show that there exist ideals Ai in R4, 
i — 1,2,..., n, such that A — A\ ® A2 © • • • © A n . 

Solution: For all fc, 1 < k < n, define a*, : ©i^ —* Rk by 

C*k ( ( a l ? • j 0*71 ) ) — O'k 

for all (<21,02, ...,a n ) E ®Ri- It can be easily verified that is an epi- 
morphism. Let a&(A) — Afc. Then Ak is an ideal of R We now show that 
A = A\ 0 A2 ® • • • © A n . Let a = (ai, 02, ..., a n ) € A. Now a*, (a) = a* E Ak. 
Therefore, a E A\ ® A2 ® ■ • • ® A n and so A C A\ © A2 © • ■ • ® A n . Sup¬ 
pose now that b — (£>1,62, • ■ •, b n ) E Ai © A2 © • • • © A n . Then bk E Ak — 
ak(A). Therefore, there exists an element a = (01,02,..., ak-i, bk, afc+i,..., 
an) e A. Now ( 0 , 0 ,..., 0 ,6fc, 0 , — , 0 ) = ( 0 , 0 ,... ,l,..., 0 )(oi,a 2 ,...,afc_i, bk, 
o/b+i,..., a n ) E A for all k = 1,2 Hence, (61, b 2 > • • •, b n ) — (&i, 0 ,..., 

0 )+ ( 0 , 62? • • • > 0 )+ • • • + ( 0 , 0 ,..., b n ) E A showing that A\ ©A2©- • -©A n C A. 
Thus, A = A\ © A 2 © ■ ■ ■ © A n . 

<(> Exercise 5 Let R be a ring with 1 . Suppose that A and B are ideals of R 
such that R — A + B. Show that 

R/{AnB) ~ R/A © R/B. 

(This result is known as the Chinese remainder theorem for rings.) 


Solution: Define f : R —> R/A © R/B by 

f{x) = {x + A, x + B) 

for all x E R. Let x,y E R. Then 

ffo + y) = {{x + y) + A, (x © y) + B) 

- {{x + A) + (y + A), {x + B) + {y + B)) 

= (x + A,x + B) + (y + A,y + B) 

= /0) + /(y)- 

Similarly, ffoy) = f{x)f{y). Hence, / is a homomorphism. Now R = A + B 
implies that 1 = a © b for some a E A and b E B. Thus, a + B — {l —b)-\-B = 
(1 + B) + (—6 + B) = 1 + B since —b E B. Similarly, b + A — 1 + A. Let (x + A, 
y + B) E R/A © R/B. Now xb + ya E R. Therefore, 


ffob + ya) 


{{xb © ya) + A, fob + ya) + B) 

((xb A) -f- (ya + A), fob + B) + {ya + B)) 

{{xb + A )+(0 + A), (0 + B)+{ya + B)) (since a E A, b E B) 
{{xb + A) , {ya + B)) 

{{x + A){b + A), {y + B){a + B)) 

((x + A)(l + A), (y+ £)(! + £)) 


(x + A, y + B). 
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Hence, / is an epimorphism. By the first isomorphism theorem (Theorem 
11 . 3 . 14 ), 

R/Kei / ~ R/AeR/B. 

We now show that Ker f — A (1 B. 


Ker / 


{x 

e 

R | 

f{x) = 0} 



{x 

e 

R 1 

(x + A, x + B) 

— 

(A,B)} 

{x 

e 

R 1 

x + A = A and 

X 

+ B = B} 

{x 

e 

R 1 

x E A and x G 

B 

'} 

{x 

e 

R 

x G A fl B } 



ADB. 





Consequently, R/ (A fl B) ~ R/A © R/B. 

13.1.2 Exercises 

1. Let R = Ri 0 i?2 © • • • © R n be a direct sum of rings. If Ai is an ideal of 
Ri , (1 < i < n), prove that A = A\ © A2 © ■ • • © A n is an ideal of R. 

2. Let R be a direct of sum of rings R\,R2, . ■., Rn with 1 . Let A be an 
ideal of R. Show that there exist ideals Ai of Ri, i = 1,2 ,..., n, such that 
A — A\ © A2 © • • • © A n and 

R/A ~ R1/A1 0 R2/A2 © • • • 0 R n /A n . 


3 . Show that the ring Z cannot be expressed as a direct sum of a finite 
family of proper ideals of Z. 

4 . If m, and n are two positive integers such that gcd(m, n) = 1 , prove that 

2mn — 0 Z n . 
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Chapter 14 

Polynomial Rings 


The study of polynomials dates back to 1650 B.C., when Egyptians were solving 
certain linear polynomial equations. In 600 B.C., Hindus had learned how to 
solve quadratic equations. However, polynomials, as we know them today, i.e., 
polynomials written in our notation, did not exist until approximately 1700 
A.D. 

About 400 A.D., the use of symbolic algebra began to appear in India and 
Arabia. Some mark the use of symbols in algebra as the first level of abstraction 
in mathematics. 


14.1 Polynomial Rings 

An important class of rings is the so-called class of polynomial rings. We are all 
familiar with polynomials. We may be used to thinking of a polynomial as an 
expression of the form ao -\-a\X + • • • +a n x n , where £ is a symbol and the a^ are 
possibly real numbers, or as a function f(x) = ao + a\x -\— ■ + a n x n . However, 
does one really know what a polynomial is? What really is the symbol xl Why 
are two polynomials ao + a\x + • ■ • +a n x n and &o + b\x+ ■ • • + 6 m :r m equal if 
and only if n = m and a* = bi, i = 1 , 2 , ..., n? In this section, we answer these 
questions and give some basic properties of polynomials. 

Definition 14 . 1.1 For any ring R , let R[x] denote the set of all infinite se¬ 
quences (ao,ai,a2, ...), where a z € R, i — 0,1,2, ..., and where there is a 
nonnegative integer n (dependent on (ao,ai,<Z2, • ■ •)) such, that for all integers 
k > n, ~ 0 . The elements of i?[x] are called polynomials over R. 

We now define addition and multiplication on R[x] as follows: 

(ao, ai, 02, • • •) + ( 6 o, h, &2> ■ • •) = (ao + &o,ai + &i,a2 + &2, • • ■) 

(ao, ai, a2, ...)■ (&o, 61,62,...) = (cq,ci,C2, ...), 
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where 


j 

Cj = ^ dibj-i for j = 0,1,2,... 

2=0 

We leave it to the reader to verify that (i?[x], +, •) is a ring. We do note 
that (0,0,...) is the additive identity of i?[x] and that the additive inverse of 
(ao,ai,...) is (— ao, — oi,...). The ring i?[x] is called a ring of polynomials 
or a polynomial ring over R. It is clear that i?[x] is commutative when R is 
commutative. Also, if R has an identity 1 , then R[x] has an identity, namely, 
( 1 , 0 , 0 , 0 ,...). 

The mapping a —> ( a , 0 , 0 ,...) is a monomorphism of R into i?[x]. Thus, R 
is embedded in R[x\. Therefore, we can consider jRasa subring of i?[x] and we 
no longer distinguish between a and (a, 0, 0,...). 

We now convert our notation of polynomials into a notation which is more 
familiar to the reader. 

Let 

a = ax° denote (a, 0,0,...) 
ax = axMenote (0, a, 0,...) 
ax 2 denote (0,0, a,...) 


Then 

(no, ai , <22, • • ■, a n , 0 ,...) (no, 0 , 0 ,...)T( 0 ,ai, 0 , 0 ,...) - | - '‘ , T( 0 , ..., 0 , 
an, 0, ...) = no + nix + a^x 2 H-+ a n x n . 

The symbol x is called an indeterminate over R and the elements no, ni, 
..., a n of R are called the coefficients of ao a\x 4- a2X 2 + • • • + a n x n . 

The reason two polynomials ao + nix + - ■ ■ +a n x n and bo + b\x+ ■ • • +b rn x Tn 
are equal if and only if n — m and n* = &*, i = 1,2, ..., n, is that the two 
sequences (ao, ni, ...) and (bo, b\, . -.) are equal if and only if a t = fy, i = 1,2, 
.... (One must recall that an infinite sequence of elements of R is a function 
from the set of nonnegative integers into R. Consequently, the concept of an 
ordered pair is again being used to give a rigorous definition of a mathematical 
concept.) 

If R has an identity 1, then we can consider x an element of R[x]. We do 
this by identifying lx with x, i.e., (0,1,0, ...) is called x. 

The reader can check that the definitions of addition and multiplication of 
two polynomials are the familiar ones. Thus, when R has an identity, ax = 
(a, 0,0,.. .)(0,1,0,...) = (0, a, 0,...) = (0,1,0, .. .)(a, 0,0, ...) = xa. 


Theorem 14 . 1.2 (%) If R is a commutative ring with 1 , then R[x\ is a com¬ 
mutative ring with 1. 

(ii) If R is an integral domain, then R[x] is also an integral dom,ain. 
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Proof. (i) Let f{x) — ao + aix+ • • • +a n x n and g{x) = bo + bix + ■ ■ ■ +b m x m 
be two elements in R[x\. Let f{x)g(x) = Co + c\x + • • • 4- c t x l and g(x)f(x) = 
do + d\x 4- • • • + d s x s . Now Cj — Yl J i=o a ibj-i and dj — Y%=o bidj-i. Since R is 
commutative, Cj = aobj 4- a\bj-\ + • ■ • 4- djbo = bodj 4- 4- • • • 4- bjdQ = dj 

for all j = 0,1,2, .... Thus, 77[x] is a commutative ring. Since 1 € 77, 1 € R[x] 
and 1 f(x) — /(x) 1 = f(x) for all /(x) G 77 [x]. Hence, R[x] is a commutative 
ring with 1. 

(ii) Let R be an integral domain. Then by (i), /?[rc] is a commutative ring 
with 1. Let f(x) = do 4- dix 4- • ■ • 4 -a n x n and g(x) = bo 4 - b\x+ ■ - • +b Tn x TTl 
be two nonzero polynomials in 7?[x]. Then there exist di and b 3 such that 
di 7 ^ 0, bj ^ 0, d l+t = 0 , and b J+ t = 0 for all t > 1 . Consider the polynomial 

f(x)g(x) = co 4- c\x H-+ c n+m x n+rn . Now c i+j = a 0 b l+ j + aib i+ j-i+ -b 

dibj 4 - • • • 4 - di + jbo — dibj ^ 0 since R is an integral domain. This implies that 
/(x)g(x) / 0. Thus, 77[x] is an integral domain. ■ 

Definition 14.1.3 Let R be a ring. If f{x) = ao 4 - ai x 4 - • • • 4 - d n x n , a n 0, is 
a polynomial in 77 [x], then n is called the degree of /(x), written deg /(x), and 
a n is called the leading coefficient of f(x). If R has an identity and, a n = 1, 
then f(x) is called a monic polynomial. 

The polynomials of degree 0 in 7?[x] are exactly those elements from 77\{0}. 
0 € 7?[x] has no degree. We call the elements of R scalar or constant poly¬ 
nomials. 

Theorem 14.1.4 Let 77 [x] be a polynomial ring and /(x), g{x) be two nonzero 
polynomials in 77 [x]. 

(i) If f(x)g{x) ± 0 , then deg f(x)g(x) < deg /(x) + deg#(x). 

(a) If /(x) 4 -g(x) ^ 0 , then 

deg(/(x) 4 - g(x)) < max{deg/(x), degp(x)}. 

Proof. (i) If /(x) = ao 4- ajx+ • • ■ +a n x n and g{x) = 60 4- &ix+-f & m x m , 

then f(x)g(x) = ao&o + (ao&i 4-ai&o)x4- • • • +a n b rn x nJrTn . If f{x)g{x) ^ 0, then 
at least one of the coefficients of f(x)g(x) is nonzero. Suppose a n b m ^ 0, then 
deg(/(x)< 7 (x)) = n + m = deg/(x) 4- deg#(x). If a n b m = 0 (which can hold if 
R has zero divisors), then deg(/(x)^(x)) < deg/(x)4- deg g{x). 

(ii) If deg/(x) > deg g{x) , then deg(/(x) 4 -g(x)) = max{deg/(x), deg^(x)}. 
If deg/(x) = degg(x), then it is possible that /(x) + g(x) - 0 or deg(/(x) 4 - 
g(x)) < max{deg/(x), deg^(x)}. We leave the details as an exercise. ■ 

From the proof of Theorem 14.1.4(i), it is immediate that if 77 is an integral 
domain, then equality holds in (i). 
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Example 14.1.5 Consider the polynomial ring Ze[^c]- Let f(x) — [1] + [2]x 2 
andg(x) = [l]4-[3]x. Then f(x)g(x) = [1]-h[3]cc-|-[2]cc 2 . Hence, deg (f(x)g(x)) — 
2 < 3 — deg f(x) + deg 5 (x). Let h{x) = [5] + [4]x 2 . Then f(x) + h(x) = 
[6] + [6]x 2 = [0] and so deg (f{x) + h(x)) is not defined. 

Theorem 14.1.6 (Division Algorithm) Let R be a commutative ring with 
1 and f{x ), g(x) be polynomials in R[x\ with the leading coefficient of g(x) a 
unit in R. Then there exist unique polynomials q{x ), rfx) E i2[ar] such that 

f{x) = q{x)g(x) +r(x), 

where either r(x) = 0 or deg r(x) < deg g(x). 

Proof. If f(x) 0 or deg f(x) < deg g(x), then we take q{x) — 0 and 
r[x) = f{x). We now assume that deg f{x) > deg g(x) and prove the result by 
induction on deg f(x) = n. If deg f(x) = deg g{x) = 0, then we have q{x) — 
f{x)g(x )~ 1 and r{x) = 0. Make the induction hypothesis that the theorem is 
true for all polynomials of degree less than n. Let fix) = ao + a\x+ • • • +a n x n 
have degree n and g{x) = bo + b\x + • ■ • + b^x™ have degree m, where n > m. 
The polynomial 


fi{x) = f(x) - (affi^x 71 m g{x) (14.1) 

has degree less than n since the coefficient of x n is a n — {a n b^f) 6 m = 0. Hence, 
by the induction hypothesis, there exist polynomials qi(x ), ri(x) G R[x] such 
that 


fiffi) = qi(x)g(x)+ri(x), (14.2) 

where 7 * 1 ( 2 :) - 0 or deg 7 * 1 ( 2 ;) < deg ( 7 ( 2 ;). Substituting the representation of 
fi(x) in Eq. (14.2) into Eq. (14.1) and solving for fix), we obtain 

f{x) = (qi(x) + a n b^x n - m )g{x) + n(x) = q(x)g{x) + r(x) t 

where q(x) = q\{x) + a n b^f x n ~ m and r( x) = 7 * 1 ( 2 :), the desired representation 
when f[x) has degree n. 

The uniqueness of qix) and r(x) remains to be shown. Suppose there are 
polynomials q\x ) and r'(x) € i2[x] such that 

fix) = q[x)g{x) + r{x) = <f(x)g{x) + r'(x), 

where r(ar) = 0 or deg7*(2:) < deg gffi), r'ffi) = 0 or degr^x) < deg gix). Then 

r(x) - r\x) = iq'Jx) - q{x))g{x). 
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Suppose r{x) — r'ix) ^ 0. Since the leading coefficient of g(x) is a unit, 
deg((g 0 ) - q(x))g{x)) = deg(g(x) - q{x )) + deg 3 ( 1 ) > degg(z). 
This implies that 

deg(r(x) - r'(x)) > deg g(x), 

which is impossible since degr(x), degr'(a;) < degg(x). Thus, 

r{x) — r'(x) = 0 or r(x ) = r'(x). 

Therefore, 


0 = (q\x) - q(x))g(x). (14.3) 

Since b m is a unit, deg(((g / (x) — q(x))g(x)) > 0 unless q'{x) — q{x) = 0. Thus, 
from Eq. (14.3), we see that q'ix) — q{x) — 0 must be the case. ■ 

The polynomials q{x) and r{x) in Theorem 14.1.6 are called the quotient 
and remainder, respectively, on division of f{x) by g(x). 

Definition 14.1.7 Let R be a commutative ring with 1 and f(x) = ao + a\x + 

-h a n x n G For all r <E R, define 

f(r ) = ao + air H-b a n r n . 

Wh.en f(r ) = 0, we call r a root or zero of fix). 

In Definition 14.1.7, we think of substituting r for x in f(x). The student 
is used to doing this freely. However, certain difficulties arise when R is not 
commutative. For instance, let f(x) ~ a — x, g{x) — b — x. Set h(x) — f(x)g(x). 

Then h(x) = (a — x)(b — x) = ab — {a + b)x + x 2 . For c G R, h(c ) = ab 
—(a+ 6 )c+c 2 = ab — ac—bc+c 2 while f(c)g(c) - (a — c)(b — c) = ab — cb—ac+c 2 . 

Hence, we cannot draw the conclusion that h(c ) = f(c)g(c). However, if R 
is commutative (with identity), then we can conclude that h(c ) = f(c)g(c). 

Clearly if k(x) = f(x) + g(x), then k(c) = f(c) + g{c). 

Definition 14.1.8 Let R be a commutative ring with 1 and f(x), g(x) G -R[m] 
be such that g(x) 7 ^ 0. We say that g(x) divides f(x) or that g(x) is a factor of 
f(x), and write g(x)\f(x) if there exists q(x) G R[x] such that f(x) = q{x)g{x). 

Theorem 14.1.9 (Remainder Theorem) Let R be a commutative ring with 
identity. For f(x) G i?[x] a,nd a G R, there exists q{x) G R[x] such that 

f(x) = (x-a)q(x) + f(a). 
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Proof. By applying the division algorithm with x — a — g(x ), there exist 
unique q(x), r(x) E R[x] such that f(x) = (x — a)q(x) + r(x), where r(x) = 0 
or degr(x) < 1. Hence, r(x) is a constant polynomial, say, r(x) = d. By 
substituting a for x, we obtain f(a ) = (a — a)q(a) + d = d, which yields the 
desired result. ■ 

Corollary 14.1.10 (Factorization Theorem) Let R be a commutative ring 
with identity. For /( x) E #[x] and a E R, x — a divides /(x) if and only if a 
is a root of f(x). 


Proof. Suppose (x — a)jf(x). Then there exists q{x) E i£[x] such that 
f(x) = (x — a)q(x). Hence, /(a) = (a — a)q(a) = 0 and so a is a root of 
f(x). Conversely, suppose a is a root of f(x). Then by the remainder theorem 
(Theorem 14.1.9) and the fact that f(a) = 0, we have f(x) = (x — a)q(x). 
Consequently, (x — a)jf(x). ■ 

Theorem 14.1.11 Let R be a.n integral dom,ain and f(x) be a nonzero poly¬ 
nomial in i?[x] of degree n. Then f(x) ha.s a,t m,ost n roots in R. 


Proof. If deg f{x) = 0, then f{x) is a constant polynomial, say, f{x) = c ^ 
0. Clearly c has no roots in R. Assume that the theorem is true for all polyno¬ 
mials of degree less than n, where n > 0 (the induction hypothesis). Suppose 
degf(x) = n. If f(x) has no roots in R , then the theorem is true. Sup¬ 
pose r E R is a root of f{x). Then by Corollary 14.1.10, f(x) = (x — r)q{x ), 
where degg(x) = n — 1. If there exists any other root r' E R of /(x), then 
0 = f(r') = (■ r f — r)q(r’). Since r' ^ r and R is an integral domain, q(r') = 0 
and so r' is a root of q(x). Therefore, any other root of f(x) is also a root 
of q{x). Since f(x) = (x — r)q(x), any root of q(x ) is also a root of f(x). 
By the induction hypothesis and the fact that deg q(x) = n — 1, there are at 
most n —1 of these other roots r'. Hence, in all, f(x) has at most n roots in R. ■ 

We now extend the definition of a polynomial ring from one indeterminate 
to several indeterminates. 

Definition 14.1.12 For any ring R , we define recursively 

R[x i, X 2 , . . . , R^X\ , X2 , • ■ ■ , X n —j j [X 71 ], 

where x\ is an indeterminate over R and x n is an indeterminate over R[x\, 
X 2 , ..., x n _i]. R[x i, X 2 ,.. -, x n ] is called a polynomial ring in n indetermi¬ 
nates. 
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Before describing the ring R[x i,X 2 , •.. ,x n ], we introduce some notation. 


We write for Ei„"=o ' ’ • £i‘=o 

each € R and k \,..., k n are nonnegative integers. 

The ring 


V' 


:^ n , where 


R[xi,x 2 ,. • • ,x n ] = { Y r h- 


r n 

l-n X l 


x 


In 


r h---in 


e R}. 


In )■■■)*! 


We have for n — 2 that 

R[x 1,X 2 ] = R[xi][x 2 \ - {Y s ^ x 2 I s *2 e R[x i]}. 

12 


Now each S{ 2 has the form E^ r^xf;. 

Thus, 

R[xi,x 2 ] = {Ei 2 (En r ili2 x\ l )x% \ r ili2 E R} 

= {Ez 2 Ei, n&xl 1 *? | r ni2 e R} 

{Ez2,n r iii2 X l x 2 I r i\i2 ^ -^•}- 

Definition 14.1.13 Let R be a subring of the ring S. Let c\, C 2 ,..., c n be ele¬ 
ments of S. Define i?[ci] — {E? r ^ c \ I r i £ an d 

l?[ci, C 2 , . . . , Ctx] , C 2 , . . . , C 7 -J. —. 1 ] [c^] . 

We say that c\, C 2 ,..., Cn are algebraically independent over R if 

Y r n...i n c l f ...c% = 0 

^71 | • j^l 

can occur only when each = 0 , where 

i?[ci, C 2 ,..., Cn] is a subring of S and equals the set of all finite sums of the 
form 

Y r h-in<?l ■•• C n > 

*n,-)U 


where € .R. 

Theorem 14.1.14 R be a subring of a commutative ring S such that R and 
S have the same identity. Let c € S. Then there exists a unique hom,omorphism, 
a of R[x\ onto R[c] such that a(x) ~ c and a(a) = a for all a 6 R. 
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Proof. Define a : i?[x] —> R[c\ by a(J2 a iX l ) ~ Y2 a i cl for all J2 a iX l € R[x\. 
Now ao + cl\X + • • • + a n x n = bo 4- b\X + • • • + 6 m x m implies that n = m and 
di = b{ for i = 1,2,..., n. Thus, ao + aic + ■ • • + a n c n = &o + b\c + ■ ■ • + b n c n 
and so a is well defined. By Definition 14.1.13, a clearly maps R[x] onto 
R[c]. Since for any two polynomials f(x), g(x ) G R[x], k(x) = f(x) + g(x) 
implies k(c) = /(c) + g(c) and h(x) = f(x)g(x) implies h(c) = f(c)g(c), it 
follows that a preserves + and *. Therefore, a is a homomorphism of R[x] onto 
i?[c]. Clearly a(x) = c and a(a ) — a for all a G R. Let (3 be a homomor¬ 
phism of i?[x] onto R[c] such that (3{x) — c and /3(a) — a for all a G R. Then 
(3(J2 diX 1 ) = J2 {3( a i)P( x Y — a i° l = a (Yl a ix 1 )- Thus, (3 — a so a is unique. ■ 

We emphasize that a is well defined in Theorem 14.1.14 because x is alge¬ 
braically independent over R. We illustrate this in the following example. 

Example 14.1.15 Define a : Q[\/2] — 3 ► Q[z] by a(X) a iV / 2) — Y) a iX l - Then a 
is not a function since a(2) = 2 and a(2) = a((v // 2) 2 ) = x 2 , but 2 x 2 . 

14.1.1 Worked-Out Exercises 
0 Exercise 1 Let R be a ring with 1. Show that 

R[x\/ (x) ~ R. 


Solution: Define f : R[x] —> R by 

/(®o T Q>\x T d 2 X 2 + • • • + a n x n ) — ao 

for all ao + a\x + a 2 X 2 + • ■ ■ 4- a n x n G i?[x]. Suppose that ao + a\x + a 2 X 2 + 
• • • + a n x n — bo + b\x + b 2 X 2 + • • • + 6 m i Tn . Then ao = bo and so /(ao + a\x + 
a 2 X 2 + • ■ • + a n x n ) = /(&o + b\x + b 2 X 2 + • • • 4- 6 m i m ). Thus, / is well defined. 
Clearly / is an epimorphism. Now ao 4- a\X + a 2 a: 2 T • ■ • + a n x n G Ker / if and 

only if /(ao + a\x + a 2 X 2 H-+ a n x n ) = 0 if and only if ao = 0 if and only if 

ao + aix + a 2 X 2 + • ■ • + a n x n G (x) . Therefore, Ker / = (x) . Thus, 

R[x\f (x) ~ R. 

Exercise 2 Let F be a field and a : F[x\ —> F[x] be an automorphism such 
that a(a ) = a for all a G F. Show that a(x ) = ax + b for some a, b G F. 


Solution: By the division algorithm, a(a;) = g(x)x+b for some g(x) G F[x] 
and b G F. Since a is onto F[x], there exist h(x),p(x) G F[x] such that g(x) = 
a(h(x)) and x = a(p(x)). Therefore, a(x) = g(x)x + b — a(h(x))a(p(x)) + 
a(b) = a(h(x)p(x ) -t- b). Thus, x = h(x)p(x) + b since a is one-one. Now 


deg(z:) = deg (h(x)p(x) + 6) implies t hat deg (h(x)p(x)) = 1 . Hence, eith 

Mai4c#fia£uiaZ rJ 
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degh(x) = 1 and degp(x) = 0 or deg h(x) — 0 and deg p(x) = 1 . Suppose 
degp(x) = 0. Then p(x) = c for some c E F. This implies that x = a(p(x)) = 
a(c ) = c, which is a contradiction. Therefore, degh(x) = 0 and degp(x) = 1 . 
Let h{x) = a for some a E F. Thus, a(x) = a(h(x))x + b = a(a)x + b = ax + b. 

0 Exercise 3 Let R be a commutative ring with 1 and /(x) = ao + a\x + 
ci 2 X 2 + • • • + a n x n E i?[x]. If ao is a unit and a 1} a, 2 ,..., a n are nilpotent 
elements, prove that f(x) is invertible. 

Solution: We prove this result by induction onn = deg f(x). If n — 0, 
then f(x) — ao- Hence, f(x) is invertible. Assume that the result is true 
for all polynomials of the above form and degree < n. Suppose now f(x) = 
ao + a\X + a^x 2 + • • • + a n x n € R[x) such that ao is a unit and ai, a 2 ,.. -, a n 
are nilpotent elements and deg/(x) = n. Let g(x) = ao + a\x + 02 a : 2 + ■ • • + 
a n _ix n_1 . Note that degp(x) < n. Hence, by the induction hypothesis, g{x) 
is invertible. Since a n is nilpotent there exists a positive integer m such 
that a™ = 0. Then (g(x) + a n x n )(g(x)~ 1 — a n g(x)~ 2 x n + a^g{x)~ 3 x 2n — • • • + 
(—1 ) m_1 a^ 1 g(x)~^ m ~ 1 ^x^ m ~ lS)n ) = 1. It now follows that f(x) is invertible. 

14.1.2 Exercises 

1. If I is an ideal of a ring R, prove that I[x\ is an ideal of the polynomial 
ring i?[x]. 

2. Let R be an integral domain. Prove that R and R[x] have the same 
characteristic. 

3. Let I? be a commutative ring with 1. Describe, (x) , the ideal of i?[x] 
generated by x. 

4. (i) Let f(x) = x 4 + 3x 3 +2x 2 + 2 and g(x) = x 2 + 2 x + 1 6 Q[x]. Find the 
unique polynomials g(x), r(x) E Q[x] such that /(x) — q(x)g{x) + r(x), 
where either r(x) = 0 or 0 < degr(x) < degp(x). 

(ii) Let /(x) = x 4 + [3]x 3 + [2]x 2 + [2] and g(x) — x 2 + [2]x + [1] E Zs[xj. 
Find q(x), r{x) E Zs[x] such that /(x) = q{x)g{x) + r(x), where either 
r(x) = 0 or 0 < degr(x) < degp(x). 

5. Let /(x) = x 5 + x 4 + x 3 + x + [3], g{x) — x 4 + x 3 + [2]x 2 + [2]x E Zs[xj. 
Find < 7 (x),r(x) E Zs[x] such that /(x) = q{x)g{x) +r(x), where either 
r(x) — 0 or 0 < deg r(x) < deg g{x). 

6 . Let R — Z 0 Z. Show that the polynomial (l,0)x in i?[x] has infinitely 
many roots in R. 

7. Show that the polynomial ring Z^x] over the ring Z 4 is infinite, but Z^x] 
is of finite characteristic. 
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8. In the ring Zs[x], show that [l] + [2}x is a unit. 

9. Let R be a commutative ring with 1 and f(x ) = ao + aix + • • • + a n x n G 

i?[x]. If f(x) is a unit in prove that ao is a unit in R and ai is 

nilpotent for alH = 1, 2,..., n. 

10. Use the result of Exercise 9 to show that 1 + 5x is not a unit, in Z[x\. 

11. Find all units of Z[x\. 

12. Find all units of Zg[x]. 

13. Let R be an integral domain. Prove that the units of _R[x] are contained 
in R. 

14. In Zq[x], prove the following. 

(i) [4]x 2 + [2}x + [4] is a zero divisor. 

(ii) [2]x is nilpotent. 

(iii) [4}x + [1] and [4]x + [3] are units. 

15. Let R be a subring of a commutative ring S such that R has an identity. 

(i) In the polynomial ring R[x\, X 2 , ■ . •, x n ]> prove that x\, X 2 ,..., x n are 
algebraically independent over R. 

(ii) Prove that the mapping 

Oi . R [xi } X2-, • ■ • i X 7 j] * j C 2 , . . . , Cjx] 

defined by a( x n) = E^...^ c \ ■■■ c n is a 

homomorphism of R[xi ,..., x n ] onto i£[ci,..., c^], where C \,..., c n G S. 

(iii) Prove that the homomorphism a in (ii) is an isomorphism if and only 
if ci, C 2 , ..., c n are algebraically independent over R. 

16. Let /(x) be a polynomial of degree n > 0 in a polynomial ring K[x\ over 
a field K. Prove that any element of the quotient ring K[x)/ ( f(x )) is of 
the form g(x) + ( f(x )) , where g{x) is a polynomial of degree at most 
n — 1. 

17. For the following statements, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) If a polynomial ring R[x] has zero divisors, so does R. 

(ii) If J? is a field, then R[x] is a field. 

(iii) In Z?[x], (x + [l]) 7 = x 7 + [1]. 
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Chapter 15 

Euclidean Domains 


We have seen that both rings Z and F[x], F a field, have a Euclidean or 
division algorithm. Because of the significance of these rings and the power of 
this common property, the concept of a division algorithm is worth abstracting. 


15.1 Euclidean Domains 

Definition 15.1.1 A Euclidean domain (E,+, v ) is an integral domain 
(E, +, •) together with a function v : E\{0} —> Z# such that 

(i) for all a, b £ E with 6^0, there exist q, r £ E such that a = qb + r, 
where either r = 0 or v{r) < v(b) and 

(ii) for all a, b £ E\{0}, r;(a) < v(ab). 
v is called a Euclidean valuation. 

The next two results show that the ring Z and the polynomial ring F[x], 
F a field, are Euclidean domains. 

Example 15.1.2 The ring Z of integers can be considered a Euclidean domain 
with via ) = \a\, a / 0. 

Theorem 15.1.3 If F is a field, then the polynomial ring F[x] is a Euclidean 
dom,ain. 

Proof. By Theorem 14.1.2(h), F[x\ is an integral domain. Define 

v : F[x]\{0} —> Z# 


by 


v(f(x)) = deg/(;r) 
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for all f(x) G F[x]\{0}. Since deg f(x) > 0, v(f(x)) G Z# for all /(x) G 
FfxjXiO}. Let /(x),p(x) G -F[x], g(x) 7 ^ 0. By Theorem 14.1.6, there exist 
g(x),r(x) G F[x] such that 

/(x) = q{x)g{x) + r(x), where either r(x) = 0 or degr(x) < degg(x). 
Hence, 

f(x) = q{x)g{x) + r(x), where either r(x) = 0 or v(r(x)) < v(g(x)). 

Let f{x) = ao+aixH- \-a n x n , a n / 0 and g(x) = bg+bix-i -h 6 m x m , 6 m 7 ^ 0. 

Then f(x)g(x) = aobo + (ao&i + ai&o)x + ■ * • + a n b m x n+m . Since F is a field 
and a n 7 ^ 0, b m 7 ^ 0, we find that a n b m 7 ^ 0. This implies that deg(/(x)g(x)) = 
n+m. Thus, v(f(x)) = deg(/(x)) = n < n+m = deg(/(x)s(x)) = v(f(x)g(x)). 
Hence, JF[x] is a Euclidean domain. ■ 

Example 15.1.4 Any field can be considered as a Euclidean domain with 
v(a) = 1 for all a 7 ^ 0. (a — (a6 _1 )6 + 0.) 

Definition 15.1.5 The subset Z[i) = {a+bi \ a, b G Z} of the complex numbers 
is called the set of Gaussian integers. 

In the next theorem, we show that Z[i] is a subring of C and determine the 
units of Z[i]. Gauss was the first to study Z[i] and hence in his honor Z[i] is 
called the ring of Gaussian integers. 

Theorem 15.1.6 The set Z [i] of Gaussian integers is a subring of C. The 
units of Z[i] are ±1 and ±i. 

Proof. It is easily verified that Z[i\ is a subring of C. Since C is a field, 
Z [i] is of course an integral domain. Suppose a + bi is a unit of Z[i\. Then 
there exists c + di G Z[i] such that (a + bi) (c + di) = 1. This implies that 1 = 
1 = (a + bi) (c + di) =(a + bi) (c + di) — (a — bi) (c — di) , where the bar denotes 
complex conjugate. Thus, 1 = (a 2 + b 2 )(c 2 + d 2 ) and therefore 1 = a 2 + b 2 . 
Hence, a = 0, 6 = ±1, or a = ±1, 6 = 0, proving that the only units of Z[i] are 
± 1 , ±i. M 

Theorem 15.1.7 The ring Z[i] of Gaussian integers becom.es a Euclidean do¬ 
main when we let the function, 

N : Z[i]\{0} -> z * 

defined by N(a + bi) = (a + bi)(a — bi) — a 2 + b 2 for all a,b G Z, serve as the 
function v. 
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Proof. Clearly N(a+bi) is a positive integer for any nonzero element a+bi E 
Z[i\. Let a + bi, c-\-di E Z[zj. Now N((a + bi)(c+di)) = N(ac — bd+ ( bc+ad)i ) = 
(ac — bd ) 2 + (6c + ad) 2 = (a 2 + 6 2 )(c 2 + d 2 ) = N(a + bi)N(c + di). From this, 
it follows that N(a + bi) < TV ((a + 6T)(c + di)). 

It remains to be shown that for a + bi and c -f di ^ 0 in Z[i], there exist 
<70 + <7ib r 0 + ni E Z[i] such that 

a + bi = (go + gi i)(c + di) 4- (ro + r\i), 

where r*o + r\i = 0 or N(tq + r\i) < N{c + di). We work backward in order to 
see how to choose qo + qii- If such an element go + q\i exists, then in C 

r 0 + r\i = (a + bi) - (c + di)(q 0 + q\i) 

= (c + di)[(a + bi)(c + di ) -1 — (go + q\i)\- 

Let (a + bi){c + di)~ l = u + vi, where u and v are rational numbers. Then 

r 0 + r\i = (c + di) [(u + vi) - (g 0 + q\i)\ 

= (c + di)[{u - q 0 ) + (v - qi)i\ 

= {c{u - go) - d(v - gi)] + [c{y - gi) + d(u - g 0 )]L 

Now 

N(r 0 + rii) = [c{u - g 0 ) - d{y - gi)] 2 + [c{v - gi) + d(u - g 0 )] 2 
= (c 2 + d 2 )[(u - g 0 ) 2 + {v - gi) 2 ]. 

Hence, N(ro +r\i) < N(c + di) if (u — go) 2 + (v — gi) 2 < 1. We now find an 
element go + gN £ Z[z] so that the latter inequality holds. Take integers go and 
gi such that (u — go) 2 < \ and (v — gi) 2 < \. Then (u — go) 2 + (v — gi) 2 < 1. 
Let 

r 0 + r\i = (a + bi) - (c + di)(q 0 + gN). 

Then a + bi = (c+di)(qo + qii) + (ro + rii), where ro + ri? = 0 or N(ro + rii) < 
N(c + di).M 

We now consider the ideals of a Euclidean domain. 

Recall that an ideal I of a ring R is called a principal ideal if I = (a) for 
some a E I. 

Definition 15.1.8 Let R be a commutative ring with 1. If every ideal of R is 
a principal ideal, then R is called a principal ideal ring. An integral dom,ain 
which is also a principal ideal ring is called a principal ideal domain (PID). 

Theorem 15.1.9 Every Euclidean dom,ain is a principal ideal domain. 
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Proof. Let E be a Euclidean domain with Euclidean valuation v. We want 
to show that every ideal of E is a principal ideal. Let I be an ideal of E. Since 
E is a commutative ring with 1 , it is enough to show that / = Ea for some 
a E E. If I is the zero ideal, then I = EO. Suppose now I / {0}. Then I con¬ 
tains some nonzero element. Let P = {^(T) | 0 ^ i € I}. This is a nonempty 
subset of the nonnegative integers. By the well-ordering principle, we find that 
P contains a least element. Therefore, there exists an element a E /, a / 0 
such that v(a ) > 0 and r/(a) < v(b) for all b E /, b / 0. We now show that 
I = Ea. Since I is an ideal and a E /, it follows that Ea C /. Let b E I. Since 
E is a Euclidean domain, there exist q,r E E such that b = aq -j- r, where 
r — 0 or v{r) < v(a). Now r — b — qa E I. If r 7 ^ 0, then v(r) E P. This is a 
contradiction of the minimality of v(a) since v[r) < v(a). Therefore, r = 0 and 
so b = qa E Ea. This proves that I C Ea. Hence, I — Ea. ■ 

By Theorem 15.1.9, Z, Efx] (F a field), and Z[i] are principal ideal domains. 

Theorem 15.1.10 Let R be a comm.uto.tive ring with 1. The following condi¬ 
tions are equivalent. 

(i) R is a field. 

( 11 ) R[x\ is a. Euclidea,n domain. 

(in) i?[rr] is a PID. 

Proof. (i)=>(ii) Follows from Theorem 15.1.3. 

(ii) =^(iii) Follows from Theorem 15.1.9. 

(iii) =^(i) Let a E R and a ^ 0. Consider / = (a, x), the ideal of i?[x] 
generated by a and x. Since i?[x] is a PID, there exists f(x) E such that 
I = {/(z)) • Now a,x E {/(x)) . Therefore, there exist g(x) and h{x) in R[x] 
such that f(x)g(x) = a and f(x)hfx) — x. Since f{x)g{x) — a, we must have 
deg f{x) = 0 and so f(x) E R. Let f(x) — b. Now bh{x) = x implies that 
be = 1 for some c E R. Thus, b is a unit and so I = (b) = i?[z]. From this, 
we have 1 El. Therefore, 1 = af\(x) 4- xf 2 {x) for some /i(x),/ 2 (x) E R[x\. 
This implies that 1 = da for some d E R. Hence, a is a unit in R and so R is a 
field. ■ 

Corollary 15.1.11 Z[x] is not a PID. 


Proof. Now Z is a commutative ring with 1. Since Z is not a field, Z[x\ is 
not a PID by Theorem 15.1.10. ■ 


We conclude this section with the following remark. 

Remark 15.1.12, Consider Z[\/—19] = {a + by/— 19 | a,b E Z and a and. b 
are either both, even or both. odd}. It is known that Z[\/—19] is a principal ideal 
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domain, but not a Euclidean domain. The proof of this result is beyond the 
scope of this book. However, the interested reader can find the proof in, J. C. 
Wilson, 11 A principal ideal ring that is not a Euclidean , ring, ” Mathematics 
Magazine 46(1973), 34 — 38. 

15.1.1 Worked-Out Exercises 

0 Exercise 1 Let (E,+,-,v) be a Euclidean domain. 

(i) Show that v(a ) = v(—a) for all a £ £\{0}. 

(ii) Show that for all a £ £\{0}, v(a) > v(l), where equality holds if and 
only if a is a unit in E. 

(iii) Let n be an integer such that i>(l) + n > 0. Show that the function 

v n : £\{0} Z # 

defined by v n {a) — v (a) + n for all a £ E\{0} is a Euclidean valuation. 

Solution: (i) For all a £ £^{0}, v(a ) = u((—1)(—a)) > v(—a) = v(( — l)a) 
> r;(a). Hence, v(a) = u(~a) for all a £ £\{0}. 

(ii) Let a £ £\{0}. Now v(a) = v(la) > v(l). Suppose a is a unit. Then 
there exists an element c £ E such that ac = 1. Thus, v(l) = v{ac) > v(a). 
This implies that v(a) = t>(l). Conversely, suppose that v(a) = v(l). Since 
a ^ 0, there exist q,r £ E such that 1 = qa + r, where r = 0 or v{r) < 'u(l). 
Now v(r ) < i>(l) is impossible. Hence, r = 0, showing that 1 = qa. Thus, a is 
a unit. 

(iii) Let a £ £\{0}. Then v n (a) = v(a) T n > v(l) + n > 0. Hence, 
v n ( a ) £ Z*. Suppose a,b £ E with & ^ 0. There exist q,r £ E such that 
a = qb + r, where either r = 0 or v(r) < v(b). Now v(r) < v(b ) implies that 
v(r) + n < v(b ) + n. Thus, v n (r) < v n (b). Also, for a,b £ £\{0}, v n (ab ) = 
v(ab) T n > t;(a) + n — v n {a). Therefore, v n is a Euclidean valuation on E. 

0 Exercise 2 Let n be a square free integer (an integer different from 0 and 1, 
which is not divisible by the square of any integer). Let Z [y/n\ — {a+b^Jri, 
| o,6 £ Z}. Show that 7f[^fn\ is an integral domain. Define a function 
N : Z [y/n\ -+ Z # by 

N(a + byfn) = (a + by/n){a — by/n) = a 2 — nb 2 . 

(i) Let x £ r L\\Jn\. Prove that N(x) = 0 if and only if x = 0. 

(ii) Prove that N(xy) = N(x)N(y) for all x,y £ Z [\fn\. 

(iii) Let x £ Zf-^/n]. Prove that N{x) — ±1 if and only if x is a unit in 

z[vH 
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Solution: Let x = a+by/n and y — c+dy/n be two elements in Z [y/n\. Now 
x — y — (a — c) + (b — d)y/n G Z [y/n\ and xy = (ac+n 6 d) + (ad+ 6 c) N /n G Z[y/n\. 
We have 0 = 0 + 0%/n 6 Z[y"n] and 1 = 1 + 0%/n G Z[%AT|. Now it is easy to 
verify that Z[%/n] is an integral domain. 

(i) Let x = a + by/n. Then N(x) = a 2 — nb 2 . Suppose N(x) = 0. If b — 0, 
then a = 0. If b / 0, then n = %: — (§) 2 , which is a contradiction to the 
assumption that n is a square free integer. Therefore, a = 0 and 6 = 0. Thus, 
x = 0. The converse is trivial. 

(ii) Let x = a + by/n and y = c + d%/n. Now 

N(xy) — [(ac + nbd) + (ad + bc)y/n\[(ac + nbd) ~ (adbc)y/n\ 

= (ac + n 6 d ) 2 — (ad + 6 c) 2 n 
= a 2 c 2 + n 2 b 2 d 2 — a 2 d 2 n — b 2 c 2 n 
— (a 2 — nb 2 )(c 2 — nd 2 ) 

= N(x)N(y). 

(iii) Let 2 = a + 6 %/n. 7 V( 2 ) = ±1 if and only if (a + by/n) (a — 6 %/n) = ±1 
if and only if a + by/n, divides 1 , i.e., if and only if a + by/n is a unit in Z[%/n]. 

<0 Exercise 3 Show that Z[%/n| is a Euclidean domain for n — —1, —2,2,3. 

Solution: By Worked-Out Exercise 2 (page 349), Z[%/n] is an integral 
domain. Define v : Zl[%/n]\{0} —» Z# by v(a + by/n) = | N(a + 6 %/n)| , where N 
is defined as in Worked-Out Exercise 2. Let a + by/n , c + dy/n G Z[%/n]\{0}. 

Now _ _ __ 

v((a + 6 %/n)(c + dy/n)) — \N((a + by/n)(c + d%/n))| 

= \(a 2 — nb 2 )(c 2 — nd 2 )\ 

= |(a 2 — n 6 2 )| |(c 2 — nd 2 )\ 

> | (a 2 — n 6 2 ) | 

= v((a + by/n)). 

Let a + 6 %/n, c + d%/n G Z[%/n] with c + d%/n / 0. We want to show that there 
exist < 7 o + < 71 %/^, ro + T\y/n G Z[%/n] such that 

a + 6 %/n = (c + dy/n) (qo + gi%/n) + (ro + r 1 %/n), 

where either ro + r\yfn = 0 or |(r 2 — nr 2 )| < |(c 2 — nd 2 )| . We work backward 
in order to see how to choose <70 + q\ %/n. If such an element qo + <71 %/n exists 
in Z[%/n], then in Q[%/n] 

ro + r\y/n — (a + by/n) - (c + dy/n,)(q 0 + gi%/n) 

= (c + d%/n)[(a + by/n)(c + d%/n) _1 - (<70 + gi%/n)]. 

Let (a + 6 %/n)(c + dy/n) -1 = u + u%/n, where u and v are rational numbers. 
Then 

ro + riy/n = (c + dy/n)[(u + Vy/n) - (qo + qi^/n)} 

= (c + dy/n )[(u - q 0 ) + (v - <?i)%/n] 

= [c(n - g 0 ) + d(v - qi)n\ + [c(v - qi) + d(u - go)]%/n. 
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Now 

v(ro + riy/n) = \ [c(u - g 0 ) + d(y - q\)n] 2 - [c(v - gi) + d{u - g 0 )] 2 n| 

= |(c 2 - nd 2 )[(u - g 0 ) 2 - n(v - gi) 2 ]| 

< | (c 2 — nd 2 ) | 

if |(n — go) 2 — n ( v ~ gi) 2 | < 1* We now find an element go + gi\/^ £ Zfy'n] such 
that | (u — go) 2 — n ( v — gi) 2 | < 1- Take integers go and gi such that (u— go) 2 < \ 
and (v — gi) 2 < \. For n = — 1 or —2, 

0 “ ?o) 2 - n{v - gi) 2 < ^ + (^n)i < 1. 

For n = 2 or 3, 

-^ < (u - go) 2 - n(v - gi) 2 < 

Then |(u — go) 2 — n(i> — gi) 2 | < 1 for n = —1, —2, 2 or 3. Hence, there exist 
go + giV™> r o + nV™ £ Zf-^/n] such that 

a + 6v^ = (c + dV^)(go + gi\/n) + (ro + ri^/n), 

where either ro + ri>/n = 0 or |(rg — nr 2 )| < |(c 2 — nd 2 ) \ . 

<0 Exercise 4 Let Z[iV3] = {a + 6z\/3 | a, b £ Z}. Show that Z[i\/3] is an 
integral domain. Define v : Z[i\/^]\{0} —> Z# by u(a + biy/ 3) = a 2 + 36 2 . 
Show that v is not a Euclidean valuation on Z[z\/3]. 

Solution: Proceeding as in Worked-Out Exercise 2 (page 349), we can 
show that Z[i\/3] is an integral domain. Suppose v is a Euclidean valuation. 
Now 2 and 1 + i\J 3 are elements of Z[iV3]- Suppose there exist go + gN\/3> 
r 0 + r\i \/3 £ Z[f\/3] such that 

2 = (1 + n/3)(go + qiiVs) + (r 0 + rjiV3), 

where either ro + rN\/3 = 0 or rg + 3r 2 < 4. If ro + r\i\J 3 = 0, then 

2 = (1 + fv / 3)(go + qiiVs). 

This implies that 

4 = v (2) = r>((l + iVS)(qo + qiiVs)) = 4(g 2 + 3g 2 ). 

Then g 2 +3g 2 = 1, which shows that go = ±1, gi = 0. As a result, 2 = l+i\/3 or 
2 = — (1 +i\/3), a contradiction. Suppose now rg + 3r 2 < 4. Then rg + 3r 2 = 1, 
2, or 3. Since ro and ri are integers, r 2 + 3r 2 ^ 2. Suppose r 2 + 3r 2 = 1. Then 
ro = ±1, ri = 0. Thus, 

2 = (1 + iy/3)(qo + qiiy/3) + (r 0 + riiVS), 
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whence 

2 = g 0 - 3gi + r 0 

and 

0 = q\ +qo +n- 

If ro = 1 and r\ = 0, then go — 3gi = 1 and gi + go = 0. This implies that 
—2gi = 1, which is impossible. Similarly, for each remaining case we can show 
a contradiction. Also, from 7 ~q + 3 r\ — 3, we can show a contradiction. Hence, 
v is not a Euclidean valuation on Z[z\/3]. 

15.1.2 Exercises 

1. Show that the mapping v : Z\{0} —► N defined by r(a) = \a\ n for some 
fixed positive integer n is a Euclidean valuation on Z. 

2. In Z[\/3], for 9 + 5\/3 and 1 + 7y/S, find go + qiVS, tq + r\yf \3 G Z[\/3] 
such that 

9 + 5y/3 = (go + giV3)(l + 7y/3) + tq + T\y/ 3, 
where either ro + r\y/Z — 0 or \tq — 2>r\\ < 146. 

3. Consider the integral domain Z[i]. Find go + gib rQ-\-r\i € Z[i] such that 

3 4- 7i = (go + giz)(l + 2 i) + ro + nz, 
where either ro + r\i = 0 or |rg + r \| <5. 

4. Let a = 3 + 8z, 6 = —2 + 3z G Z[z]. Find c ,d = x + yi in Z[z] such that 
a = bc-\- d, where either d = 0 or x 2 + y 2 <9. 

5. Let / : R —> S be an epimorphism of rings. If R is a principal ideal ring, 
prove that S is also a principal ideal ring. 

6. Prove that the ring Z n is a principal ideal ring for all n G N. 

7. Which of the following statements are true? Justify. 

(i) (Z, +, -,v) is a Euclidean domain, where v(n) = n 2 for all n > 0. 

(ii) (Q, +, ■, v) is a Euclidean domain, where v(2) = ^ for all | ^ 0. 

(iii) If a ring R is a PID, then every subring of R with identity is a PID. 
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15.2 Greatest Common Divisors 

Definition 15.2.1 Let R be a commutative ring and a, b 6 R be such that 
a 7^ 0. If there exists c £ R such that b = ac , then a is said to divide b or a is 
said to be a divisor of b and we write a\b. 

When we write a|6, we mean that a^O and a divides b. The notation a / 
b will mean that a does not divide b. 

Let R be a commutative ring with 1. By Definition 15.2.1, the following 
results follow immediately. For all a, 6, c 6 R, 

(i) a|a, l|a and a|0, 

(ii) a is a unit if and only if a|l, 

(iii) if a\b and b\c ) then a|c. 

Definition 15.2.2 Let R be a commutative ring with 1. A nonzero element 
a € R is said to be an associate of a nonzero element b 6 R if a = bu for 
som,e unit u E R. 

Example 15.2.3 (i) In Z, 1 and —1 are the only units. For every a E Z, a 
and —a are associates. 

(ii) /nZ[i], 1,-1, i,—i are the only units. Thus, 1 + z, — 1 — i, —1 +i, 1 —i 
are all associates of 1 + i. 

Example 15.2.4 In the polynomial ring F[x] over a field, F, the units form, the 
set F\{0}. A nonconstant polynomial f(x) has uf(x) for an associate, where 
u is a, unit in F. 

Theorem 15.2.5 Let R be a commutative ring with 1 and a,b,cE R. 

(i) If a is an associate of b , then b is an associate of a. 

(ii) If a is an associate ofb and b is an associate of c, then a is an associate 
of c. 

(iii) Suppose R is an integral domain. Then a is an associate of b if and 
only if a\b and b\a. 

(iv) Suppose R is an integral domain. Then a and b are associates of each 
other if and only if (a) = (b) . 

Proof. (i) This result follows from the fact that the inverse of a unit is also 
a unit. 

(ii) This result follows from the fact that the product of two units is also a 
unit. 

(iii) Suppose a is an associate of b. Then a = bu for some unit u £ R. This 
implies that 6 — an -1 . Hence, a|6 and b\a. Conversely, suppose that a\b and 
b\a. Then there exist q\, q 2 G 1? such that a = q\b and 6 = q 2 a. Thus, b = q 2 q\b 
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and so 1 = q 2 q\ by cancellation. This implies that q\ and q 2 are units and so 
a and b are associates. 

(iv) The result here follows from (iii) and the fact that (a) = {q 2 d \ q 2 € R} 
and ( b) = {q\b \ qi E -R}. ■ 

We now introduce the notion of a greatest common divisor in a commutative 
ring. 

Definition 15.2.6 Let R be a commutative ring and a\, < 22 ,..., a n be elements 
in R , not all zero. A nonzero element d E R is called a common divisor of 
< 21 , < 22 ,..., a n if d\ai for all i = 1,2, ..., n. A nonzero element d E R is called 
a greatest common divisor (gcd) of ai, a 2 ,..., a n if 

(i) d is a comm,on divisor of a\, < 22 ,..., a n and 

(ii) if c E R is a comm,on divisor 0 / ai, < 22 ,..., a n , then c\d. 

The greatest common divisor (gcd) of two elements need not be unique. In 
fact, the gcd of two elements may not even exist. 

Example 15.2.7 Consider the ring Z 10 . Then [4] = [4] [ 6 ] and [ 6 ] = [4] [4]. 
This shows that [4] and [ 6 ] are comm,on divisors of each other. Hence, [4] 
and [ 6 ] m.ust be greatest comm,on divisors of [4] and [ 6 ]. Now [4] and [ 6 ] are 
associates since [9] is a unit and [ 6 ] = [9] [4]. 

Example 15.2.8 In the ring E of even integers, 2 has no divisor. Hence, 2 
and no other even integer ca,n have a common divisor. 

Example 15.2.9 In a field F , a\b and. b\a for all a,b £ F with a / 0 and 
b 7 ^ 0. Thus, every nonzero element is a gcd, of any pair of elements. 

The next result shows that in a principal ideal ring, every pair of elements 
not both zero has a gcd. 

Theorem 15.2.10 Let R be a principal ideal ring and a, b E R not both zero. 
Then a and b have a gcd d. For every gcd d of a and 6 , there exist s, t E R 
such that d — sa + tb. 

Proof. The ideal (a, b) of R must be a principal ideal, whence there exists 
d E R such that (a, 6 ) = (d ). Thus, there exist u,v E R such that a = ud and 
b = vd. Therefore, d is a common divisor of a and b. Since d E (a, b) , there 
exist s,t E R such that d = sa + tb. Now suppose c is any common divisor 
of a and b. Then there exist u',v' E R such that a = u'c and b = v'c. Thus, 
d = ( su' + tv')c and so c\d. Hence, d is a gcd of a and b. Let d' be any gcd 
of a and b. Then d\d' and d'\d, whence (d') = (d) = (a,b) . Thus, there exist 
s' ,t' E R such that d f = s'a + t/b. ■ 
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Corollary 15.2.11 Let R be a Euclidean domain and, a,b 6 R, not both zero. 
Then a and b have a gcd d. For every gcd, d of a and 6, there exist s, t E R 
such that d = sa + tb. 

Proof. Since every Euclidean domain is a principal ideal ring, the corollary 
follows by Theorem 15.2.10. ■ 

Proceeding as in the proof of Theorem 15.2.10, we can prove a similar result 
for any finite set of elements ai,a 2 ,..., a n (not all zero) of a principal ideal 
ring. 

Let R be an integral domain and aj, < 22 ,..., a n E R, not all zero. Suppose 
that a gcd of ai,a 2 ,..., a n exists. Let d and d' be two greatest common 
divisors of a\, < 22 ,..., a n . Then d\d' and d'\d. We ask the reader to verify in 
Exercise 6 (page 359) that d and d' are associates. If d is a gcd of ai, < 22 ,..., 
a n , then any associate of d is also a gcd of a 1; < 22 ,..., a n . Considering this, we 
can say that the gcd of a\ , a 2 ,..., a n is unique in the sense that if d and d' 
are greatest common divisors of < 21 , 02 , ..., a n , then d and d' are associates. 
Hence, from now on, the gcd of aq, 02 ,..., a n is denoted by gcd(ai, 02 ,..., a n ). 
This outcome motivates the definition of associates. We will further motivate 
this concept when we examine unique factorization in integral domains. 

In a Euclidean domain (E,+,*,u), we have seen that the gcd (a,b) of two 
elements a,b £ E (a,b not both zero) exists in E. Next we give an algorithm 
similar to the algorithm of finding the gcd of two integers given in Chapter 1. 

Let a,b £ E with 6^0. 

Step 1: Find q\ and r\ in E such that a = qib + r\, where r\ — 0 or 
u(ri) < v(b). If r\ — 0, then b\a and so gcd(a, b) = b. If rq / 0, then gcd(a, b ) = 
gcd(6,7~i). Thus, we need to find gcd(6,rq). 

Step 2: Find <72 and r 2 in E such that b = q\r\ -f 7 - 2 ,.where r 2 = 0 or 
v(r 2 ) < u(?q). If r 2 = 0, then gcd(a,6) = gcd(6, rq) = r \. If V 2 ^ 0, then 
proceed to find gcd(r 1 ,r , 2 ). Since v(b) > v(ri) > v(r 2 ) > • • • is a strictly 
descending chain of nonnegative integers, the above process must stop after a 
finite number of steps. Therefore, there exists a positive integer n such that in 
the nth step there exist elements q n and r n in E such that r n _2 = q n r n -\ +r n , 
where r n — 0. Thus, 


= gcd(6, r : ) 

(a = q\b + r 1} u(r x ) < v(b)) 

= gcd(r 1} r 2 ) 

(b = q 2 r x + r 2 , v(r 2 ) < vfa)) 

= gcd(r 2 ,r 3 ) 

(n = 93^2 + r 2 >i v(rz) < v(r 2 )) 

= gcd(r n _ 2 , r n -i) 

{r n -3 = gn-l^n -2 + ^n- 1 , 


v(r n - 1 ) < v{r n - 2 

= gcd(r n _i, r n ) 

(r n _ 2 = q n r n - 1 + r n , r n = 0). 




gcd(a, b) 
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Next we find x, y in E such that gcd(a, b) — ax + by. 


r n—l — ^n— 3 Qn—l'^'n —2 

= T'n—Z ~ Qn — li^n —4 — Qn~ 2 r n— 3) 

= ^71—3(1 d" ( Qn — 1 )( Qn— 2)) T T n — 4 ( Qn— l) 


= by + ax. 


15.2.1 Worked-Out Exercises 

0 Exercise 1 Let £ be a Euclidean domain. Let a,b,q,r E E be such that 
b ^ 0, a = qb + r, and r ^ 0. Show that gcd(a, 6) = gcd(6, r). 

Solution: Let gcd(a.6) = d and gcd(6, r) = d'. Now d|a and d\b. Thus, 
r = a — qb implies that d|r. Hence, we find that d is a common divisor of b and 
r and so d'\d. Now d'\b and d'\r and so a = qb + r implies that d'\a. Therefore, 
d' is a common divisor of a and b and so d\d'. By Theorem 15.2.5(iii), it follows 
that d and d' are associates and so gcd(a,6) = gcd(6,r). 

Exercise 2 Let a, 6, and c be three nonzero elements of a PID R. Show that 
there exist x,y E R such that ax + by = c if and only if gcd(a, 6)|c. 


Solution: Let gcd(a,6) = d. Suppose there exist x,y E R such that ax + 
by — c. Since d\a and d|fr, we find that d\c. Conversely, suppose that gcd(a, 6)|c. 
Then c = dd' for some d' E R. Now there exist x\ y' E R such that d = ax' + by'. 
Then ax'd' + by'd' = dd' = c. Let x = x'd' and y = y'd' . Then ax by = c. 


0 Exercise 3 In the domain Z[«\/5], prove the following: 

(i) gcd(2,1 + iy/b) = 1, 

(ii) gcd of 6(1 — i\J 5) and 3(1 -f fv / 5)(l — i\J 5) does not exist. 


Solution: (i) In Z[f\/5], the units are 1 and —1. Let a-\-ib\J 5 = gcd(2, 1 + 
i\j 5). Then (a + ibv' 5)|2. Thus, 2 = (a + iby/b) (c + idy/E) for some c-\-id\J 5 E 


Z[i\/5]. 

This implies that 



4 = (a 2 + 5 b 2 )(c 2 + 5d 2 ). 


Hence, 

a 2 + 56 2 - 2, c 2 + 5d 2 = 2 

(15.1) 

or 

a 2 + 5 b 2 = 4, c 2 + 5 d 2 = 1 

(15.2) 

or 

a 2 + 56 2 = 1, c 2 + 5 d 2 - 4. 

(15.3) 
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Now Eqs. (15.1) cannot hold for any c,d G Z. The only integral solutions 
of a 2 + 56 2 = 4 are a = ±2 and 6 = 0 and the only integral solutions of 
a 2 + 56 2 = 1 are a = ±1 and 6 = 0. Thus, from Eqs. (15.2) and Eqs. (15.3) 
we find that gcd( 2,1 + iy/ 5) = 1 or 2. If gcd(2,1 + iy/b) = 2 , then 2|(1 + iy/ 5). 
Hence, 1 + iy/b = 2{p + iqy/ 5) for some p + iqy/ 5 G Z[z\/5]. This implies that 
2p = 1 = 2g. But there do not exist integers p and q such that 2p = 1 = 2q. 
Therefore, gcd(2,1 + iy/ 5) = 1. 

(ii) Suppose gcd( 6 (l — z\/5),3(l + iy/ 5)(1 — iy/ 5)) exists. Then gcd( 6 (l — 
iy/ 5), 3(1 + iy/ 5)(1 - i>/5)) = 3(1 - iy/b) gcd(2,1 + iy/b) = 3(1 - iy/b). Now 
(1 + iy/ 5)(1 — iy/ 5) = 6 . Hence, 6 is a common divisor of 6(1 — iy/ 5) and 
3(l + z\/5)(l —iy/ 5). Consequently, 6|3(1 — iy/b). This implies that 21 (1 — iy/b), 
which is not true in Z[iy/b\. Therefore, gcd( 6 (l — z\/5),3(l + iy/b)(l — iy/b)) 
does not exist. 


<0 Exercise 4 In Z [z], find gcd(9 — 5z, —9 + 13z). 


Solution: By Theorem 15.1.7, Z[z] is a Euclidean domain, where the 
valuation is defined by N(a + hi) = a 2 + 6 2 . Now N (9 — bi) — 106 and 
iV(—9 + 13z) = 250. 

Ci.. n . —9+132 _ (-9+130(9+50 _ —81—452+1172—65 _ -146+722 __ -146 , 

oiep J.. 9 _ 5i — 106 — 1Q6 — 1Q6 - inf; T 

Thus, —9 + 13 i = (—1 + z)(9 — 5z) — 

360+306^200^+170 = (_]_ + *)(9 _ 5 i) - 

Note that N {—5 — i) < N (9 — 5z). 


106 
40+34z 


106 


106 
40+34i 


530+1 
106 


(9 — 5i) = (—1 -h i)(9 — 5i) - 
= (-l + i)(9-5z) + (-5-*)- 


Step 2: ^ = 


9-5i _ 9— 5i -5+2 _ -45 +92 +252+5 _ -40+34i _ -20+17i _ -20 


-5 — 2 —5+2 


26 


-7+42 


26 


13 


13 


+ 


13* ~ ( _1 “ 13 ) + (1 + T 3 )* = (“1 + *) + 13 * 

Thus, 9 — bi = (—1 + z)(—5 — i) 4- ~ 7 1 ^ 4t (—5 — i) = (—1 + i)(—5 — i) + 

35 - +7^02 ±4 = (_ 1 + i)(_5_i)+ 39^31 = (-i + i )(_ 5 - i) + (3 _ {). Note that 

N{3 — i)< N(—b — i). 

oi pll q. —5—2 —5—2 3+2 —15—52—32+1 —14—82 —7—42 ^7 4 i 

Otep o. 3 _. — 3_ z 3+i — 10 — 10 “ 5 — 5 5 — 

(-l-i)-(l-l)i=(-l-0 + 


- 2+2 
5 

- 2+2 . 


Thus, -b-i = (—1 — z)(3 —z) + “5 (3 —i) = (-l-i)(3-z) + 


(-l-z)(3-z) + 


—5+52 _ 


—6+22+32+1 _ 

5 


= (—1—z)(3—z) + (—1+z). Note that N( — l+i) < N(3—i). 


Step 4: ^ = 

Thus, 3 — i — (—2 + z )(—1 + z) + 0. 
Hence, gcd(9 — 5z, —9 + 13z) = — 1 + i. 


_ 3—2 —1—2 —3—32+2 — 1 —4 — 2i O I • 

1+2 “ (- 1+0 (- 1-0 ~ 2 — 2 — ^ "T 


<0 Exercise 5 In Z[x], find two polynomials f(x) and g(x) such that gcd(/(x), 
g{x )) = 1 , but there do not exist f\{x) and gi(x) in Z[x] such that 
1 = fi x )fi( x )+ g{x)g\(x). 
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Solution: x + 6 and x -f- 4 are elements of Z[x}. The gcd(x + 6 , x + 4) = 1. 
Suppose there exist fi(x) and g\{x) in Z[x\ such that 

1 = (x + 6 )/i(x) + (x + 4)pi(:r). (15.4) 

The constant term of the right-hand side in Eq. (15.4) is an even integer, 
whereas in the left-hand side, the constant term is 1, a contradiction. Hence, 
there do not exist fi(x) and gi(x) in Z[x] such that 1 = (x + 6)fi(x ) + (x + 

4)pi 0*0 ■ 

Exercise 6 Let R be a commutative ring with 1 and S denote the set of all 
infinite sequences {a n } of elements from R. Define + and • on S by 

{ a n} T {5n} — { a n T and 

{ a n} ' {^n} — 

where 


c n = ao b n + ai& n _i +-b a n bo for all n = 0 , 1 , 2 ,- 

Show that 

(i) S' is a commutative ring with 1 ; 

(ii) an element {a n } is a unit if and only if ao is a unit in R; 
(hi) if R is a field, then S' is a PID. 


Solution: (i) It is easy to verify that S' is a commutative ring with 1. The 
sequence {1,0,0,...} is the identity element of S. 

(ii) Let {a n } G S. Suppose {a n } is a unit. Then there exists a sequence 
{ 6 n } such that {a„}{ 6 „} = 1 . Hence, ao^o = 1 and so ao is a unit. Conversely, 
suppose that ao is a unit. We now consider the sequence { b n }, where bo = a,Q l . 
b\ ag (aia 0 ), ■ •., bk ag (aj&k—^ -j- * ■ ■ T Qfc^o), k 2. Now ag 6 g — 1, 
ao&i + Q\bo = ao(—a 0 1 (aia 0 1 ))+ aia 0 1 = 0 , ..., a^bo + dk- 1^1 + ■ ■ • + 0 - 0 bk = 

a k bo+a k -ibi-\ -b a 0 (-ag 1 (a 1 5 fc _H-ba fc 5 0 )) = 0. Therefore, {a n }{6 n } = 1. 

proving that {a n } is a unit. 

(iii) Suppose R is a field. Let I be an ideal of S. If I = {0}, then I is 
a principal ideal. Suppose I 7 ^ {0}. Let {a n } be a nonzero element of I. We 
define the order of a nonzero sequence {a n } as the first nonnegative integer n 
such that a n 7 ^ 0, i.e., n is a nonnegative integer such that a n 7 ^ 0 and a; = 0 
for i < n. There exists a sequence {a n } such that order of {a n } < order of 
{b n } for all {&„} G I. Suppose order of {a n } = k. Let {c n } be a sequence such 
that Ci = a,k+i for all i > 0. Then (c n } _1 exists and {c n }~ 1 {a n } = {d n } G I. 
Also, dk = 1 and d{ = 0 for all i =4 k. We now show that I = ({d n }). Clearly 
{{d n }) Q I • Suppose {a n } G I. Let the order of {u n } be m. Then m > k. Let 
{r n } G S be such that r m -k+i — u m+i for all 2 > 0 and 77 = 0 for all i < m, — k. 
It is easy to verify that {u n } = {r n }{d n } G ({d^}). Hence, I = {{d n }}. 
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15.2.2 Exercises 

1. Find all associates of (i) 3 — 2i in Z[i], (ii) 1 + iV 5 in Z[i\/5], (iii) [6] in 
Zio, (iv) [4] in Z 5 , and (v) [2] + x in Z 3 [x], 

2 . Find all the units of the integral domain Z[i\/3]- 

3. Find all the associates of 2 4- x — 3x 2 in Z[x\. 

4. Show that [4] and [6] are associates in Zio- 

5. Find all units of the polynomial ring Zj[x\. Find all associates of x 2 + [2] 
in Zj[x ]. 

6 . Let R be an integral domain and a 2 ,..., a n (n > 2) be elements 
of R not all zero. If d\ and d ,2 are two greatest common divisors of 
ai, 02 , -. •, Q, n , prove that d\ and cfo are associates. 

7. Let (E, +, - t v) be a Euclidean domain. Let a,b E E be such that a and 
b are associates. Prove that v(a) — v(b). 

8 . Let (E, +, -,v) be a Euclidean domain and a, b € E. If o|6 and v(a) = v(b), 
prove that a and b are associates. 

9. Let (E, 4-, -,v) be a Euclidean domain and a and b be nonzero elements 
of E. Prove that v{ab ) > v(a) if and only if b is not a unit. 

10. Let E be a Euclidean domain. Let a, a' , 6, b', d be nonzero elements of E 
such that a — a'd and b = b'd. Prove that gcd(a / ,6 / ) — 1 if and only if 
gcd(a, b ) — d. 

11. In a PID R, prove that the congruence ax = 6(mod c), where a,b,c are 
nonzero elements of R has a solution in R if and only if gcd(a, c)|6. (Here 
ax = 6(mod c) means ax — b — cr for some r € R.) 

12. Let R be an integral domain. Let a, 6, and c be nonzero elements of R such 
that gcd(a,6) and gcd(ca,c6) exist. Prove that gcd(ca,c5) = cgcd(a,6). 

13. In Z[i], find gcd(2 — 7i,2 + lli). Also, find x and y in Z[i\ such that 

gcd(2 — 7i, 2 + lli) = x(2 — 7i) + y{ 2 + Hi). 

14. Let R be an integral domain and ai,a, 2 ,... ,a n (n > 2) be nonzero el¬ 
ements of R. An element d £ R is called a least common multiple 

(lcm) of ai, a. 2 ,..., a n if 

(i) ai\d, i = 1, 2,... n and 

(ii) if c E R is such that a^Jc, i = 1,2 ,... n, then d\c. 

Prove the following in R. 
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(i) If d\ and are two least common multiples of ai, a 2 ,..., a n , then d\ 
and c ?2 are associates. 

(ii) If d is a least common multiple of < 21 , 02 ,... , a n , then rd is a least 
common multiple of ra\, ra^,..., ra n , for all r E R, r 7 ^ 0. 

15. Let I be the set of all nonunits of Z [i] . Is I an ideal of Z[i]? Show that 
for any nontrivial ideal P of Z[i], the quotient ring Z [i]fP is a finite ring. 

16. Show that Z[\/2] has no unit between 1 and 1 + \/ 2 . 

17. In the domain Z[\/2], prove that an element a + by/2 ^ ±1 is a unit if and 
only if a + bs/2 = (1 + y/2) k or a + b\/2 — — (1 + y/2) k for some positive 
integer k. 

18. An integral domain R is said to satisfy the gcd property if every finite 
nonempty subset of R has a gcd. Prove that every PID satisfies the gcd 
property. 

19. Prove that the integral domain Z[\/2] satisfies the gcd property, where 
the gcd property is defined in Exercise 18. 

15.3 Prime and Irreducible Elements 

In this section, we introduce the concepts of prime elements and irreducible 
elements in a commutative ring with 1 . We show that in a PID and hence in 
a Euclidean domain these two concepts coincide. 

Definition 15.3.1 Let R be a com,mutative ring with 1. 

(i) An element p of R is called irreducible if p is nonzero a,nd a, nonunit, 
and p = ab with a,b £ R implies that either a or b is a unit. An element p of 
R is called reducible if p is not irreducible. 

(H) An element p of R is called prime if p is nonzero and a, nonunit, and 
if whenever p\ab, a,b G R, then either p divides a or p divides b. 

(in) Two elements a and b of R are called relatively prime if their only 
comm,on divisors are units. 

Remark 15.3.2 Let p € Z. If p is an ordinary prime, then both p and —p are 
irreducible and, prim.e in the sense of Definition 15.3.1. 

From the definition of an irreducible element, it follows that the only divi¬ 
sors of an irreducible element p are the associates of p and the unit elements 
of R. The converse of this result does not always hold in a commutative ring 
with 1 . 
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Example 15.3.3 The ring Zq is a, commutative ring with 1. In this ring, the 
unit el.em.ents are [ 1 ] and [5]. Since [3] = [3] [3] a,nd [3] is not a unit it follows 
that [3] is not irreducible. But [3] is an associate of [3]. Also, in Zq, it can 
be verified that [3] is divisible only by associates and the units of Zq. Next, we 
show that [3] is a prime elem.ent in Zq. Let [a], [ 6 ] £ Zq and [3]|[a][6]. Then there 
exists [c] 6 Zq such that [a] [ 6 ] = [3][c], i.e., [ ab} = [3c]. From, this, it follows 
that 6|(a6 — 3c). This im.pli.es that 31 (a .6 — 3c). Since 3|3c, we must have 3| ab. 
Since 3 is prime in Z, 3|a or 3| b. Thus, either [3]|[a] or [3] | [ 6 ]. Hence, [3] is a 
prim.e el.em.ent in Zq. 

Theorem 15.3.4 Let R be an integral dom.ain and, p £ R be such that p is 
nonzero and a nonunit. Then p is irreducible if and only if the only divisors of 
p are the associates of p and the unit elements of R. 

Proof. Suppose the only divisors of p are the associates of p and the unit 
elements of R. Let p = ab for some a,b £ R. Suppose a is not a unit. Then a is 
an associate of p. Therefore, a = pu for some unit u £ R. Now p = pub. Since 
R is an integral domain, it follows that ub = 1. Hence, 6 is a unit and so p is 
irreducible. We leave the converse as an exercise. ■ 


We now consider several examples of prime elements and irreducible ele¬ 
ments. 


Example 15.3.5 In Z, 1 and —1 are the only units, and therefore 2 is divisible 
by ±1 and ±2. It follows that 2 is not divisible by any other integer. Therefore, 
2 is an irreducible element. Suppose now 2 |a 6 and 2 does not divide a for som.e 
a,b £ Z. Since 2 does not divide a, a is an odd integer and. so gcd(2,a) = 1. 
Therefore, there exist c, d £ Z such tha,t 1 = 2 c + ad. Thus, b = 2cb + abd. Since 
2| ab and 2|26c, it follows that 2| b. Hence, 2 is prime. 

Example 15.3.6 The polynomial x 2 + 1 is irreducible in R[x], but is reducible 
in C[x]. If x 2 + 1 were reducible in R[x], then there would exist real numbers 
a, b, c, d such that 

x 2 + 1 = (ax + b)(cx + d) — acx 2 + (ad + bc)x + bd. 

Then ac = 1 = bd and ad + bc — 0. Thus, 1 = ( ac)(bd ) = (ad)(be) = (ad)(—ad). 
Hence, 1 — —(ad) 2 , which is impossible in R. However, x 2 + 1 = (x + i)(x — i) 
in C[x]. 

Example 15.3.7 The polynomial x 2 — 2 is irreducible in Q[x] and reducible 
in R[x]. If x 2 — 2 were reducible in Q[m] , then there would exist a,b,c,d £ Q 
such that 


x 2 — 2 = (ax T 6 )(ex + d) = acx 2 + (ad + bc)x + bd. 

Pufuc. 7feoiAe»tafL<ra/ “Phy-SALc-A- 
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Then ac = 1 ,ad + bc = 0, and bd = —2. Thus, ( ad ) 2 = ( ad)(ad ) = —( ad)(be ) = 
(ac)(— 6 c?) = 2. This implies that y/2 = ad E Q. TTus is a contradiction since 
V2 i Q- Therefore, x 2 — 2 is irreducible in Q[z]. However, x 2 — 2 — (z — 
V^Xz + v^) in R[z]. 

Example 15.3.8 The polynomial ax b is irreducible in F[x], where F is 
a field and a 0. Suppose ax + b = f(x)g(x). Then deg (f(x)g(x)) = 1 = 
deg/(z)+deg < 5 r(z). We m,ay assum,e that deg/(z) = 0 and deg g{x) = 1 . Since 
deg/(z) = 0, /(z) is a nonzero constant polynomial and thus a unit. Hence, 
ax + b is irreducible. 

Example 15.3.9 Consider the polynomial ring Z[z ,y]. Then x and y are ir¬ 
reducible. 2 z is not prime since 2 z| 2 z, but 2x does not divide 2 a,nd 2 z does 
not divide x. Also, 2x is reducible, x 2 and, y 2 a,re relatively prime, but neither 
is irreducible nor prime. 

Theorem 15.3.10 Let R be an integral dom.ain and p be a, prim,e elem,ent in 
R. Then p is irreducible. 

Proof. Suppose p = be for some b,c € R. To show p is irreducible, we must 
show that either 6 is a unit or c is a unit. Now p = be implies that p\b.c. Since 
p is prime, p\b or p\c. If p\b, then b = pq for some q E R. Thus, p = be — pqc 
and so p{ 1 — qc ) = 0. Since R is an integral domain and p ^ 0, p{ 1 — qc ) = 0 
and so 1 — qc — 0. Thus, qc = 1, which implies that c is a unit. Similarly, if 
p|c, then b is a unit. Hence, p is irreducible. ■ 

The following example shows that the converse of Theorem 15.3.10 is not 
true. 

Example 15.3.11 Consider the integral domain 

Z[i\/5] — {a + biVb [ a, b £ Zj. 

Let us show that 3 = 3 + 0iy/b G Z[i^/b} is irreducible, but not prim,e. Suppose 
3 = (a + 6f\/5)(c + di\/5) in Z[i\/5]- Then 3 = 3 = (a — biy/5)(c — diy/fi). Hence. 
9 = (a 2 +56 2 )(c 2 +5d 2 ). Since a, 6 , c, d are integers, the previous equality implies 
that 

a 2 + 5 b 2 = 3 and c 2 + 5 d 2 = 3 (15.5 

or 

a 2 + 5 b 2 = 1 a,nd c 2 + 5 d 2 ~ 9 (15.6 

or 

a 2 + 5 b 2 = 9 and c 2 + 5 d 2 = 1. (15.7 

TfeaiAewtafLca/ 
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Clearly there do not exist integers a,b,c,d satisfying Eqs. (15.5). The first 
equation of Eqs. (15.6) implies that b = 0 and a = ±1. Thus, it follows that 
a + bi\/E is a unit. Similarly, the second equation of Eqs. (15.7) implies that 
c+diyf 5 is a unit. Hence, 3 is irreducible. Now 3|6 and 6 = (l+i\/5)(l — iy/5). 
Suppose 3|(l+i\/5). Then l+f\/5 = 3(a+6i\/5) for som,e a,b GZ. This implies 
that 3a = 1, a contradiction, since the equation 3a = 1 has no solution in Z. 
Hence, 3 does not divide (1 + i\/ 5). Similarly, 3 does not divide (1 — i\J 5). 
Thus, 3 is not prime. 

The following theorem show that the converse of Theorem 15.3.10 holds in 
a principal ideal ring. 

Theorem 15.3.12 Let R be a principal ideal ring andp E R. Ifp is irreducible, 
then p is prime. 

Proof. Suppose p divides ab , where a,b E R. Then there exists r E R such 
that pr = ab. Now ( p , b) = (d) for some d E R. Therefore, there exists q E R 
such that p = dq. Since p is irreducible, either d or q must be a unit. If d is a 
unit, then (p, b) = (d) = R. Hence, 1 = sp + tb for some s,t E R. Therefore, 
a = asp + atb = asp + tpr = (as + tr)p. This implies that p divides a. If, on the 
other hand, q is a unit, then d = pq~ l E (p) . Thus, (d) C (p) C ( p , b) = (d) so 
that (p) = (p,b) . Hence, b E (p) and so p divides b. M 

Corollary 15.3.13 Let R be a principal ideal domain and p E R. Then p is 
irreducible if and only if p is prim,e. 

Proof. The result follows by Theorems 15.3.10 and 15.3.12. ■ 

Corollary 15.3.14 Let R be a Euclidean dom,ain a,nd p E R. Then p is irre¬ 
ducible if and only if p is prime. 

Proof. Since every Euclidean domain is a principal domain, the result 
follows from Corollary 15.3.13. ■ 

Theorem 15.3.15 Let R be a principal ideal ring and a, b E R. If a and b are 
relatively prime, then there exist s, t E R such that 1 — sa + tb. 

Proof. Since the common divisors are units, 1 is a gcd of a and b. The 
desired result follows from Theorem 15.2.10. ■ 

We conclude this section by proving the following theorem, which charac¬ 
terizes irreducible polynomials over a field. 
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Theorem 15.3.16 Consider the polynomial ring F[x] over the field F and 
p(x) G -F[x]. Then the following conditions are equivalent. 

(i) p{x) is irreducible. 

(ii) F[x\/ (p(x)) is an integral domain. 

(Hi) F[x]/ (p(x)) is a, f\.eld. 


Proof. (i)=>(iii). Let f{x) G F[x\/ (p(x)) be such that f{x) ^ 0, where 
fix) denotes the coset f(x) + (p(x)) . Now up(x ) and u, where u G .F\{0}, 
are the only elements of F[x] which divide p(x). Since f{x) £ (p(x)) , f{x) 
and p(x) are relatively prime and so there exist s(x), t(x) G F[x] such that 
1 = s(x)f(x) + t{x)p(x). Thus 

I = s(x)f{x) + t(x)p(x) (in F[x}/(p(x ))) 

and so 1 = s(x) f(x). Hence, f(x) has an inverse, namely, s(:r), and so 
F[x]/ (p{x)) is a field. 

(iii)=t>(ii): Immediate. 

(ii) =>-(i): If p(x) is a unit, then (p(x)) = F[x\ and so F[x)f (p{x)) = {0}, a 
contradiction to the hypothesis that F[x}/ (p(x)) is an integral domain. There¬ 
fore, p(x) is not a unit. Suppose p(x) = f(x)g(x). Then 0 — p(x) — f{x)g[x) = 
f{x) g{x). Therefore, f{x) = 0 or g(x) — 0. This implies that f{x) G (p(x)) or 
g{x) G (p(x)) , say, f(x) G (p(x)) . Thus, f(x) = q{x)p{x) for some q(x) G F[x]. 
Hence, p{x) — q(x)p(x)g(x) and so by a degree argument q(x), g(x) G .F\{0} 
are units. Thus, the only factorization of p(x) is u~ 1 (up(x)), where u is a unit 
in F[x}. Consequently, p{x) is irreducible. ■ 

15.3.1 Worked-Out Exercises 

0 Exercise 1 Show that [2] is a prime element in Zio, but [2] is not irreducible 
in Z 10 . 

Solution: In Zio, [1], [3], [7], and [9] are the only units. Now [2] = [2] • [6]. 
Since neither [2] nor [6] is a unit, [2] is reducible. Suppose [2]|[a][6]. Then [2]|[a6]. 
Therefore, [ab] = [fc][2] for some [fc] G Zio- This implies that ab — 2k — lOr for 
some r G Z, i.e., ab = 2k F 10 r = 2 (k + 5 r). Therefore, 2|a6. Since 2 is prime 
in Z, 2]a or 2| b. Hence, [2]|[a] or [2]|[6]. Thus, [2] is prime. Note that Zio is not 
an integral domain. 

0 Exercise 2 Let R be an integral domain such that any two elements a, b G 
R , not both zero, have a gcd d expressible in the form d — ra+tb , r, t G R. 
Let p G R. Show that p is prime if and only if p is irreducible. 

Solution: Every prime element in an integral domain is irreducible by 
Theorem 15.3.10. Let us prove the converse. Suppose p is irreducible. Let 
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p\ab, a,b G R. Now gcd(p, a) exists in R. Let d = gcd(p, a). Since d\p and p is 
irreducible, it follows that either d is an associate of p or d is a unit. Suppose 
d is an associate of p. Then p\d. This implies that p|a, since d\a. Suppose d is a 
unit. Since 1 is an associate of d, 1 = gcd(p, a). Thus, there exist s,t G R such 
that 1 = ps + at. This implies that b = psb + abt. Now p\psb and p\abt. Hence, 
p\b. 

<0> Exercise 3 Let n be a square free integer (an integer different from 0 and 1, 
which is not divisible by the square of any integer). Let Z [y/n\ = {a+by/n 
| a, b G Z}. Define a function N : Z[-\/n] —► Z by 

TV (a + by/n) = (a + by/n) {a — by/n) = a 2 — nb 2 . 

Show that if N(x) is a prime integer, then x is irreducible for all x G 

z[vH 

Solution: Suppose N(x ) = p, where p is a prime integer. Suppose x — 
(i a + by/n)(c+dy/n ). Now p = N(a + by/n)N(c + dy/ri) — (a 2 —nb 2 ){c 2 —nd 2 ) by 
Worked-Out Exercise 2 (page 349). Hence, either ( a 2 ~nb 2 ) = ±1 or (c 2 — n d 2 ) = 
±1, i.e., either a + by/n is a unit or c + dyfn is a unit. Thus, x is irreducible. 

15.3.2 Exercises 

1. Show that in the integral domain Z[i\/5], 2-\-iy/b is an irreducible element, 
but not a prime element. 

2. Show that 2 — i, 1 + i, and 11 are irreducible elements in Z[i]. 

3. In Z[i\/5], show that 3 is not a prime element. 

4. In Z 12 , show that [3] is a prime element, but is not irreducible. 

5. Is the polynomial x 2 + [1] irreducible in Z 2 [^c]? 

6. Let T be the set of all sequences {a n } of elements of Z. Prove the follow¬ 
ing. 

(i) T is an integral domain with respect to addition and multiplication 
defined by for all {a n }, {6 n } £ T, 

T = "b 

{fl n } • {frn} = {Cn}, wher e C n = J2i=0 a i b n-i- 

(ii) To = {{a„} G T \ ai = 0 for all but a finite number of indices} is a 
subring with identity. 

(iii) The element (1,1,0,...) is a unit in T, but not in Tq. 

(iv) (2, 3,1, 0, 0,...) is irreducible in T, but not in To- 
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7. Let R be an integral domain. Show that (i) every associate of an irre¬ 
ducible element in R is irreducible and (ii) every associate of a prime 
element in R is prime. 

8. In Z[£], show that 3 is a prime element, but 5 is not a prime element. 

9. What are the prime elements of Zg? Are they irreducible? 

10. In Z[i], if a 4- bi is an element such that a 2 + 6 2 is a prime integer, then 
show that a + bi is a prime element. 

11. Let a + bis/ 3 6 Z[i\/3]. If a 2 4- 36 2 is a prime integer, show that a 4- bi\J 3 
is an irreducible element in Z[f%/3]- 

12. In the following exercises, write the proof if the statement is true; other¬ 
wise, give a counterexample. 

(i) 13 is an irreducible element in Z[i\. 

(ii) Every prime element of Z is also a prime element of Z [f]. 

(iii) In Zi8, every prime element is an irreducible element. 

(iv) In Z [i], a 4- bi is a prime element if and only if a — bi is a prime 
element. 

(v) In a PID J?, if p and q are two prime elements such that p\q , then p 
and q are associates. 





Chapter 16 


Unique Factorization Domains 

16.1 Unique Factorization Domains 

In this section, we study those integral domains in which an analogue of the 
fundamental theorem of arithmetic holds. 


Definition 16.1.1 A nonzero nonunit element a of an integral domain D is 
said to have a factorization if a can be expressed, as 


a = p lP2 • • -p n , 


where p\,p 2 , ■ • ■,p n are irreducible elements of D. The expression P 1 P 2 • • -p n is 
called a factorization of a. 


An integral domain D is called a factorization domain (FD) if every 
nonzero nonunit element has a factorization. 

In Chapter 15, we saw' that in an integral domain D every nonzero element 
a G D is always divisible by the associates of a and the units of D. These 
are called the trivial factors of a. All other factors (if any) of a are called 
nontrivial. For example, ±2 and ±3 are nontrivial factors of 6 in Z. In 
the following lemma, we show that a nonzero nonunit element that has no 
factorization as a product of irreducible elements can be expressed as a product 
of any number of nontrivial factors. 


Lemma 16.1.2 Let D be an integral dom,ain. Let a be a nonzero nonunit 
element of D such that a does not ha,ve a factorization. Then for every positive 
integer n, there exist nontrivial factors aj, 02 ,..., a n € D of a such that a = 
0 , 1(22 ' ’ ’ O'n- 
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Proof. By the hypothesis, a is not irreducible. Therefore, a = a\b\, where 
ai, bi E D are nontrivial factors of a. At least one of a\ or b\ does not have 
a factorization; otherwise the factorization of a\ and b\ put together produces 
a factorization of a. Suppose a\ does not have a factorization. Then a\ is 
a nonzero nonunit element and a\ is not irreducible. There exist nontrivial 
factors d2? £>2 € D of a\ such that a\ = d2&2- Then a = 02^2^1- Now at least one 
of <22 or 62 does not have a factorization. If 02 does not have a factorization, we 
repeat the above process with 0.2. Proceeding this way, we can find nontrivial 
factors <21, a.2,..., a n E D of a such that a = a\a 2 ■ ■ ■ a n - ® 

Theorem 16.1.3 Let D be an integral domain with a function N : D\{0} —» 

Z* such that for all a,b E D\{0}, N(ab) > N(b), where equality holds if and 
only if a is a unit. Then D is a FD. 

Proof. Suppose D contains a nonzero nonunit element a such that a does 
not have a factorization. Now N(a) € Z#. Let N(a) — n. By Lemma 16.1.2, a 
can be expressed as a product of n + 2 nontrivial factors ai,<22,..., a n+ 2 G D. 

Then a — a\a 2 • • • a n + 2 &nd 

n = N(a) 

> N(a2 ■ • ■ a n + 2) (since 01 is not a unit) 

> N{a^ • • • a n+ 2) 

> N(a 4 ■ • • a n+2 ) 

> N(a n ±\i 2 Tl _|-2) 

> N(a n+ 2 ). 

This shows that there exist at least n + 1 distinct nonnegative integers strictly 
less than n, a contradiction. Thus, D is a FD. ■ 

Example 16.1.4 Consider the integral domain Z[i\. Define 

N : Z[z]\{0} Z # 

by N(a + bi) = a 2 -\-b 2 for all a-\-bi E Z[i]. It is easy to verify that a + bi is a unit 
if and only if N(a + bi) = 1. Let a + bi , c + di be two nonzero elements of Z[i]. 

Then N((a-\-bi)(c-\- di )) = N((ac — bd) + (ad + bc)i) = (ac — bd) 2 + (ad + 6c) 2 = 

(a 2 + b 2 )(c 2 + d 2 ) > (c 2 + d 2 ) = N(c + di), where the equality holds if a.nd only 
if N(a + bi) is a unit. Hence, Z[z] is a FD. 

Definition 16.1.5 An integral dom.ain D is said to satisfy the ascending 
chain condition for principal ideals (ACCP), if for each sequence of 
principal ideals, {<21), (02) , (<23) ,... such that 

(<>l) £ <<*2> C(a 3 ) C 

TfeaiAewtaftea/ r4y.Ai.cJ. 
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there exists a positive integer n (depending on the sequence) such that (a n ) = 
(a t ) for all t > n. 

Lemma 16.1.6 Every principal ideal domain D satisfies the ACCP. 

Proof. Let (a\) C ( 02 ) C ( 03 ) C • •• be a chain of principal ideals in D. 
It can be easily verified that I = U^n ( a i) is an ideal of D. Since D is a 
PID, there exists an element a E D such that I = (a). Hence, a E { a n ) for 
some positive integer n. Then / C (a n ) C I. Therefore, I = (a n ) . For t > n, 
(at) Q I = (a n ) C (at) • Thus, (a n ) = (a t ) for all t > n. ■ 

Theorem 16.1.7 An integral dom,ain D with the ACCP is a FD. 

Proof. Suppose D is not a FD. Then there exists a nonzero nonunit element 
a such that a does not have a factorization. Thus, a is not irreducible and so 
a = ai 6 i, where a\,bi E D are nontrivial factors of a. At least one of ai 
or 61 must not have a factorization, otherwise the factorization of a\ and b\ 
put together will produce a factorization of a. Suppose a 1 does not have a 
factorization. Now a and a\ are not associates. Therefore, (a) C (a\) . Since a\ 
does not have a factorization, we can express a\ = (Z 262 , where 02,62 6 D are 
nontrivial factors of ai- At least one of <22 or 62 does not have a factorization. 
Suppose 02 does not have a factorization. Then (o) C ( 01 ) C ( 02 ). We now 
repeat the above process with 02 - Thus, we find that there exists an infinite 
strictly ascending chain of principal ideals in D , a contradiction. Hence, D is 
a FD. ■ 

Corollary 16.1.8 Every PID is a FD. 

Proof. The proof is immediate by Lemma 16.1.6 and Theorem 16.1.7. ■ 

Definition 16.1.9 An integral dom,ain D is called a unique factorization 
domain (UFD) if the following two conditions hold in D : 

(i) every nonzero nonunit elem.ent, of D can be expressed as 

a = V 1 V 2 ■ • ■ Pn , 

where pj, P 2 ,.. . ,p n are irreducible elements of D and 

(ii) if a = P 1 P 2 • • - pn — qiq2 ‘ ‘' qm are two factorizations of a as a finite 
product of irreducible elem.ents of D, then n = m and there is a, permutation <7 
of { 1 , 2, ..., n} such that pi and q c (j) are associates for all i = 1 , 2 ,..., n. 

From the above definition, it follows that an integral domain D is a UFD 
if and only if D is a FD and every nonzero nonunit element of D is uniquely 
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expressible (apart from unit factors and order of the factors) as a finite product 
of irreducible elements. 

Let us first prove the following interesting property of a UFD. 

Theorem 16.1.10 In a unique factorization domain, every irreducible ele¬ 
ment is prime. 

Proof. Let D be a UFD. Let p be an irreducible element of D and p\ab in D , 
where a, b G D. If a = 0, then p divides a, and if b = 0, then p divides b. If a is a 
unit, then p divides b , and if 6 is a unit, then p divides a. We now assume that a 
and b are nonzero and nonunits. Now ab — pc for some ceD. Let d — pc = ab. 
Since neither a nor b is a unit, it follows that d is not a unit. If c is a unit, 
then d is irreducible and so either a or 6 must be a unit, a contradiction. 
Therefore, c is not a unit. Since D is a UFD, there exist irreducible elements 
ci, C 2 ,..., c n , ai, a 2 , • - •, a mj and b\, 62 ,..., b T in D such that c — c\C 2 • • • c n , a — 
a\a 2 • • • Q m , and b — & 1&2 ■ ■ ■ b T . Hence, d = pc\C 2 ■ • ■ Cn = a 102 * • • a Tn b\b 2 ■ • ■ b r 
are two expressions of d as a finite product of irreducible elements. Since D is 
UFD, p must be an associate of one of the irreducible elements < 21 , 02 ,..., a m , 
&i, & 2 > ■ ■ • > &r- If °ne of ai, 02 , • • ■, is an associate of p, then p|o, and if one of 
61 , 62 ,..., b r is an associate of p, then p\b. Hence, p is prime. ■ 

Example 16.1.11 Consider the integral domain Z[iy/b\ = {a + biy/ 5 | a, 6 G 
Z}. Define 

N : Z[iVb}\{0} -* Z # 

by 

N( a + biy/b) = a 2 + 56 2 . 

We can show that a+bi\/b is a unit if and only if N(a+biy/b) = 1. Let a+biy/b , 
c + diy/b be two nonzero elements ofZ[iy/b]. Then N((a + biy/b)(c + diy/5)) = 
N((ac~5bd)+i{ad+bc)y/E) ~ (ac — 56d) 2 + 5(ad+ 6 c ) 2 = (a 2 + 56 2 )(c 2 + 5d 2 ) > 
(c 2j rbd 2 ) = W((c+<ii\/5)), where equality holds if a,nd, only ?/A r ((a+6i\/5)) — 1 , 
i.e., if and only if a + biy /5 is a. unit. Hence, Z[i^/5] is a FD by Theorem, 
16.1.3. In Example 15.3.11, we showed that 3 is an irreducible element. Now 
3|(2 + i\/ 5)(2 — z\/5). Suppose 3[(2 + i\J 5). Then 2 + i\ff> — 3(m + niy/ 5) for 
som,e m + niy/ 5 G Z[iy/E\. This implies 2 = 3m and 1 = 3n, which is impossible 
for integers m, and n. Therefore, 3 / (2 + iy/b). Similarly, 3/(2 — iy/b). Thus, 
3 is not prim,e in Z[iy/b]. Hence, Z [iy/b] is not a UFD by Theorem, 16.1.10. 

In this integral dom,a,in, we can also show that 2, 1 -f iy/b, 1 — iy/b are 
irreducible elem.ev.ts and 2 is not an associate of any one of IF iy/b and, l-iV5. 
Hence, 6 = 2 • 3 = (1 + tv / 5)( 1 - is/ 5) are two factorizations of 6 , but there 
does not exist any correspondence between the irreducible factors such that the 
corresponding elements are associates. 


“Public. 
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Theorem 16.1.12 A factorization dom.ain D is a, UFD if and only if every 
irreducible element of D is a prime element. 

Proof. Suppose the factorization domain D is a UFD. Then by Theorem 
16.1.10, every irreducible element is a prime element. 

Conversely, assume that every irreducible element is a prime element in the 
FD D. Suppose a = P 1 P 2 ■ • • p n — Q\ <72 ■ ■ • q m are two factorizations of a as a 
finite product of irreducible elements. Then P 1 P 2 ■ "Pn — <7i(<? 2 * • *<?m) implies 
that qi\{p\P 2 • • - Pn)- Since q\ is also prime, at least one of p\, p 2 , ..., p n is 
divisible by q\. Let q\\p\. Now p\ and q\ are both irreducible. Hence, p\ — 
u\q\ for some unit u\. Then u\q\p 2 ■ ■ -p n — 9i?2 • • • <?m, from which it follows 
by the cancelation property that U\p 2 • • -p n = g2 ■ • * g m = <? 2(<?3 • • • q m )• Now 
g 2 |(uiP2 • • -p n )- Since <72 is prime, q 2 does not divide u\. Hence, q 2 divides one 
of p2j • - •, Pn, say, q 2 \p2- Then P 2 = U2<?2 for some unit U 2 and uiU2q2P3 • • -p n = 

<?2 • • • qm- Canceling q 2 from this relation, we obtain U 1 U 2 P 3 ■ ■ -p n — <73 • ■ • q m - 
If n > m, then proceeding this way we find that U\U 2 ■ • • u m Pm +1 • ■ -p n — 1, 
which implies that each of p m + 1 , ..., p n is a unit, a contradiction. If n < m, 
then we find that U 1 U 2 • ■ ■ u n = q n+ \ • • • q m . This implies that each of q n +i, ■ • •, 
q m divides a unit, which is again a contradiction. Thus, n = m. Also, we have 
shown that the corresponding irreducible factors Pi,qi , i = 1,2, ..., n, in the 
factorizations P 1 P 2 • • -p n a nd < 71(72 • • • q n a re associates. Hence, D is a UFD. ■ 

Theorem 16.1.13 Every PID is a, UFD. 

Proof. From Lemma 16.1.6, we find that every PID satisfies ACCP. Hence, 
by Theorem 16.1.7, every PID is a FD. Also, by Theorem 15.3.12, every irre¬ 
ducible element is prime in a PID. Thus, by Theorem 16.1.12, it follows that 
every PID is a UFD. ■ 

By Theorem 15.1.9, every Euclidean domain is a PID and hence by Theorem 
16.1.13, every Euclidean domain is a UFD. This result is one of the important 
results in factorization theory. Let us prove this result independently. First we 
prove the following lemma. 

Lemma 16.1.14 Let E be a Euclidean dom,a,in and a,b £ E. If a\b, b ^ 0, and. 
a is neither a unit nor an associate of 6, then v(a) < v(b). 

Proof. Since a is not an associate of b, it follows that b fa. Hence, a = bq-\-r , 
where r = 0 or v(r) < v(b). Now b = ac for some c G E. This implies that r = 
a — bq = a — acq = a(l —eg). If 1 —eg = 0, then c is a unit and so b is an associate 
of a, a contradiction. Therefore, 1 — eg 0. Thus, v(r) = v(a( 1 — eg)) > v(a ) 
and so v(b) > v(a). ■ 

Theorem 16.1.15 A Euclidean domnin E is a, unique factorization domain. 
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Proof. Let v denote the Euclidean valuation of the Euclidean domain E. By 
induction on 7 ;(a), we first show that every nonzero element a of E is either a 
unit or can be written as a finite product of irreducible elements. If v(a) = v(l), 
then a is a unit. Assume that every nonzero element b E E is either a unit 
or expressible as a finite product of irreducible elements if v(b) < v(a), where 
v(a ) > 'u(l) (the induction hypothesis). If a is irreducible, there is nothing to 
prove. Suppose that a is not irreducible. Then a — be, where neither b nor c is 
a unit. Suppose b is an associate of a. Then b = au for some unit u E E. Thus, 
a — be — auc and so 1 = uc , i.e., c is a unit, a contradiction. Therefore, b is not 
an associate of a. Similarly, c is not an associate of a. By Lemma 16.1.14, it now 
follows that v(b) < v(a) and v(c ) < t»(a). Thus, by our induction hypothesis, 
b and c are expressible as a finite product of irreducible elements of E. Hence, 
so is a. 

The uniqueness of the factorization follows as in Theorem 16.1.12 ■ 

From Theorem 15.1.9, we know that every Euclidean domain is a principal 
ideal domain. We noted in the remark on page 348 that the converse of this 
result is not true. In Theorem 16.1.13, we showed that every principal ideal 
domain is a unique factorization domain. The converse of this result is also not 
true. There is a class of rings for which the converse is true. Call a complex 
number an algebraic integer if it is a root of a monic polynomial p(x) in 
Z[x], The set of all algebraic integers in a finite field extension (Chapter 24) 
of Q is such a ring. However, most of these rings are not unique factorization 
domains. For example, the ring Z[z\/5] in Example 16.1.11 is a ring in which 
there is no unique factorization. Here 6 = (1 — iy/ 5)(1 + iyfh) = 2-3 are 
two factorizations of 6 as a product of two irreducible elements. However, the 
ideal (6) has a unique (up to order) factorization as a product of prime ideals 

(defined in Chapter 17), (6) = ^3,1 + iy/ 5^ ^3,1 — iy/ 5^ ^2,1 + iy/f/j . As a 
matter of fact, the entire class of rings in question has the property that every 
ideal has a unique factorization as a product of prime ideals. 


16.1.1 Worked-Out Exercises 

0 Exercise 1 Show that the integral domain Z[\/lO] = {a + by /10 | a, b £ Z} 
is a FD. 


Solution: Define N : Z[x/l0]\{0} —> Z# by for all a + by/ To E Z[\/T0], 


N(a + by/ To) 



Now N(a + by/ TO) = 1 if and only if \o? — 106 2 1 = 1 if and only if (a + 
by/lO) (a — by/to) — ±1 if and only if a A by/10 is a unit. Let cl + by/ 10, 

c+dy/lQ be two nonzero elements of Z[\/l0l. Then N((aAby/l6)(c+dy/l 6)) = 
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| a 2 — 106 2 1 |c 2 — 10 c/ 2 1 > |c 2 — 10 c/ 2 1 = N((c + d\/l0)), where equality holds if 
and only if lV((a + 6\/i0)) = 1, i.e., if and only if a + by/lQ is a unit. Hence, 
Z[\/5] is a FD by Theorem 16.1.3. 

-0 Exercise 2 Show that in a UFD, every nonzero nonunit has only a finite 
number of nonassociated nontrivial factors. 


Solution: Let D be a UFD. Suppose a is a nonzero nonunit element of D. 
Then a can be expressed uniquely as 

a = p r iP2 Pk k , 

where pi,P 2 , ■ ■ • ,Pk are distinct primes and rq, T 2 , •.., are positive integers. 
Let d = pj-p 2 2 ■ • -p 1 ^i where 0 < U < r*, i = 1,2, ..., k. Then d is a divisor 
of a. Now suppose d is any divisor of a and d is a nonunit. Then d can 
be expressed uniquely as d = q\ l q 1 ^ ■ • • , where <?i, <721 ■ ■ • ? Qm ar e distinct 

primes and t\U2i ■ ■ ■ dm are positive integers. Since d\a, for all i = 1,2, ..., 
m, q^lp^ for some j. 1 < j < k. Then q^ \p T -° and so q t \pj. Therefore, qi 
is an associate of pj. Also, we find that t* < rj. Thus, d is an associate of 

P 1 P 2 * • , 0 < U < Ti , i = 1,2, Consequently, a has only a finite 

number of nonassociated nontrivial divisors. 

0 Exercise 3 Let R — {ao +a\x + • • ■ + a n x n E Q[x] | ao E Z, n E Z*}. Show 
that R is not a UFD. 

Solution: Clearly R is a subring of Q[x] and R contains 1 . Hence, R is an 
integral domain. Now any unit of R is also a unit of Q[a;]. In Q[x], the units 
are the nonzero elements of Q. Since i?D Q — Z, it follows that 1 and —1 are 
the only units of R. For any nonnegative integer n, x E R and t^x is not an 
associate of when n 7 ^ m. Now x — 2 n (^x) shows that ^x is a divisor 
of x. Hence, x has infinite number of nontrivial divisors in R. If R is a UFD, 
then x cannot have an infinite number of nontrivial divisors. Thus, R is not a 
UFD. 


<) Exercise 4 In a UFD, show that the gcd of any two nonzero elements exists. 


Solution: Let R be a UFD and a, b be nonzero elements of R. If one of a 
or & is a unit, then gcd (a, 6 ) = 1. Suppose a and b are nonunits. Then a can 
be expressed uniquely as 

a = P t i l P t 2---P t k> 

where pi,P2, • • • ,Pk are irreducible elements such that pi is not an associate 
of pj when i 7 ^ j and UT 25 ■ ■ ■ Uk are positive integers. Similarly, b can be 
expressed uniquely (up to associates) as 


b = q?q r * 


• ■ • q Tn 
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where <?i, < 72 , • • ■, Qn are irreducible and Tq, ..., r n are positive integers. Now 
if qi is not an associate of any of pi,... ,pfc, then we write a = ■ - • p l £ ■ 

eft. Next if <72 is not an associate of any of pi,P2 ? • ■ • iPki then we write a = 
Pi^P^ '' •Pk k Ql ( &- But, ^ an associate of one of pi,P 2 , ■ ■ ■ ,Pk, then skip <72 
and consider <73. Continue the process for <73,..., q n . We do the same thing for 
b. So we can write 


a 

b 


u 

u 


1 U 2 
h y h ■ 
1 u 2 


■ u 


Tim 

m 


u: 


m 1 


where U\,U 2 ,... ,u m are irreducible elements such that ui is not an associate 
of Uj when i ^ j and n\, ri 2 ,..., n m , I 1 A 2 , • ■ • are nonnegative integers. Let 
d — u^u 1 ^ ■ ■ • it^, where hi = min{n z , k], i = 1,2 ,,m. Then d\a and d\b. 
Let c|a and c|6, c G R. Since any irreducible divisor of c is an associate of one 
of u\, U 2 ,..., u m , it follows that c must be of the form 


c = 





7 


where hi > 0, and hi < hi < h, i = 1,2Thus, h t <k z ,i — 
1,2,..., m. Hence, c|d. Thus, d — gcd(a, b). 


16.1.2 Exercises 

1. Show that Z satisfies the ACCP. 

2. If the integral domain R satisfies the ACCP, prove that the polynomial 
ring R[x\ satisfies the ACCP. 

3. Prove that an integral domain D is a UFD if and only if D satisfies the 
ACCP and every irreducible element is prime in D. 

4. Show that the integral domains Z[i-\/6], Z[i\/7], and Z[£\/l0] are factor¬ 
ization domains, but not unique factorization domains. 

5. Let a,b be two nonzero elements of a UFD D. If gcd(a,6) = 1 and a|c, 
6|c, prove that ab\c in D, where c6D. 

6. For the following statements, write the proof if the statement is true; 
otherwise, give a counterexample. 

(i) Any subring of a UFD with identity is also a UFD. 

(ii) 1 and —1 are the only units of a UFD. 


16.2 Factorization of Polynomials over a UFD 

In this section, we show that every polynomial of degree > 1 over a UFD R 
can be uniquely expressed as a product of irreducible polynomials over R. 
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Definition 16.2.1 Let f(x) = ao + a\x + - - ■ + a n x n be a nonzero polynomial 
in R[x]. Then the gcd{ao, a\ ,... ,a n } is called the content of f(x). 

It is known that the gcd of {ao,a\, ... ,a n } is not unique. If u and v are 
two gcd’s of {ao, ai,..., a n }, then u and v are associates. Hence, if c\ and C2 
are two contents of f(x), then c\ and C 2 are associates and any associate of c\ 
is also a content of f{x). If a and b are two elements of R such that a is an 
associate of b, then we write a ~ b. 

The content of f(x) is denoted by cont/(x). 

Definition 16.2.2 A nonzero polynomial f(x) G R[x] is called, a, primitive 
polynomial if contf(x ) is a, unit. 

Lemma 16.2.3 Let R be a UFD. Let f{x) and, g{x) be two primitive polyno¬ 
mials in R[x}. Then f(x)g(x) is also a primitive polynomial in R[x]. 


Proof. Let f(x) = ao + a\x + • • • + a n x n and g(x ) = &o + bix -4--I- 

6 m x m . Let Cf ~cont f(x) and c g ~cont g(x). Since f(x) and g{x) are primitive, 
cj and c g are unit elements in R. Suppose that f(x)g(x) is not a primitive 
polynomial. Let f{x)g(x) = Co + c\x + • • - + c n + m a: n+7Tl , where co = aobo, 
ci = ao&i + ai6o,..., c* — Yfj=o a jbi-j, where aj = 0 if j > n, and bi-j = 0 
if i — j > m. Now cont f(x)g{x) is not a unit. Let p be a prime element in R 
such that p divides cont / (x )g (x). Then p divides c^ for alii — 0,1,..., n + m. 
Since Cf and c g are unit elements, p does not divide each of ao, ai,..., a n and 
also p does not divide each of 6o, &i,..., b m . Let t be the smallest nonnegative 
integer such that p does not divide a t . Then p divides a^, for i = 0,1,..., t — 1, 
and p does not divide at. Similarly, let r be the smallest nonnegative integer 
such that p does not divide b r . Then p divides bj, for j — 0,1,..., r — 1, and 
p does not divide b r . Therefore, p does not divide atb T . Now Ct+ T — aobt+r+ 

ai6 i+r _i+-b a f _i& r+ i+ a t b T + a t+ ib r -i+ - ha t+T bo , where = 0 if i > ra 

and ai — 0 if i > n. Now p divides a^, for i = 0,1,..., t — 1, p divides bj, 
for j — 0,1,...,r — 1, and p divides Q +r . Hence, p divides atb r , which is a 
contradiction. Thus, cont f(x)g(x) is a unit and so f{x)g{x) is a primitive 
polynomial. ■ 

Example 16.2.4 In Z[x], 6x 2 -rSx — 9 — 3(2x 2 + x — 3). Hence, 6x 2 + 3x — 9 
is not a. primitive polynomial. But 2x 2 + x — 3 is a primitive polynomial. 


Theorem 16.2.5 Let R be a UFD. Let /( x) and g{x) be two nonzero polyno¬ 
mials in i?[x]. Then there exists a unit u £ R such that 

cont(f(x)g(x)) = ucontf (x) contg(x). 
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Proof. Let c/ denote cont/(x) and c g denote contg(x). Then f(x) = Cff\(x) 
and g(x ) = c g g\{x ), where f\(x) and g\(x) are primitive polynomials in R[x}. 
Now cont(f(x)g(x)) and cont(cfC g fi(x)gi(x )) are associates. Since CfC g is a 
nonzero element of i?, it follows that 

cont(cfC g fi(x)gi(x)) 


and 

CfC g cont(fi(x)gi(x)) 

are associates. By Lemma 16.2.3, cont(fi(x)gi(x)) is a unit. Hence, 

cont(/(x)g(x)) = ucfC g 


for some unit u. ■ 

It is known that the polynomial ring F[x] over a field F is a Euclidean 
domain, and hence a unique factorization domain. To take advantage of this 
result, let us extend an integral domain R to its quotient field Q(R) and es¬ 
tablish the relationship between elements of Q(R)[x] and R[x). 

In the remainder of the section, we let Q(R) denote the quotient field of R. 

Lemma 16.2.6 Let R be a UFD. If f{x) is a nonzero polynomial in Q(R)[x). 
then there exist nonzero elements a,b E R and a primitive polynomial fi(x) in 
i?[a;] such that f(x) = a6 -1 /i(T)) where 6 -1 is the inverse of b in Q(i?)[^]. 


Proof. Let f{x) =■ cq + c\x-\ — ■ + c n x n E Q(R)[x] be a nonzero polynomial. 
Then Ci E Q(R), i = 0,1,...,n. Therefore, there exist a*, 6; £ R such that 
Ci = a t b~ l , bi 0, i = 0,1,..., n. Now f(x) = aob^ 1 + aib^x H — • + a n bL l x n . 
Let b — 6q6i • • • b n . Then 


bf(x) = aobi •••&„ + ai&o ^>2 * ■ ■ b n x + • • • + a n 6o^i • • ■ b n -\x n E R[x]. 

Clearly bf(x) is nonzero. Let a = cont (bf(x)). Then bf[x) = af\{x ), where 
cont/i(ic) is a unit and f\(x) E R[x\. Hence, f(x) = b~ 1 afi(x). where 6 ,a E R 
and fi(x) is a primitive polynomial in 7?[x]. ■ 


Lemma 16.2.7 Let R be a UFD. Let f(x) be a nonzero polynomial in R[x\. 
If f(x) =■ d\f\{x) = ^ 2 / 2 ( 2 :), where fi(x) and f 2 (x) are primitive polynomials 
in i7[x] and d\,d 2 E Q(R ), then d\ = ud 2 for som.e unit u E R. 
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Proof. Since di,d 2 £ Q(R), we can write d\ = a6 _1 and = ccCHor 
some a, 6, c, d £ R. Thus, /(x) = ab~ 1 fi(x ) = cd _ 1 / 2 (x). This implies that 
adfi(x) = 06 / 2 ( 2 ;). Since fi(x) and f 2 (x) are primitive, ad = ucb for some unit 
u £ R by Theorem 16.2.5. Thus, d\ = a 6 _1 = ucd~ l = ud 2 - ■ 

Lemma 16.2.8 Let R be a UFD. Let /(x) be a nonconstant primitive poly¬ 
nomial in R[x]. Then /(x) is irreducible in jR[x] if and only if f(x) is irreducible 
in <2(J7)[x]. 


Proof. Suppose /(x) is irreducible in R[x] and /(x) is not irreducible in 
Q(7?)[x]. Then there exist h{x),g(x) £ Q(i?)[x] such that /(x) = h{x)g{x), 
deg/i(x) > 1, and deg<?(x) > 1. By Lemma 16.2.6, there exist a,b,c,d £ R 
with 6 / 0, d / 0, and primitive polynomials hi(x), g\[x) £ i?[x] such that 
h(x) = ab~ 1 h\(x) and g{x) — cd~ l g\{x). Hence, /(x) — ab~ l cd~ l hi(x)g\(x). 
This implies that bdf{x) = ach\(x)gi(x). Now /(x) is primitive and so cont/(x) 
is a unit. Thus, cont(6d/(x)) — bdu for some unit u. Now 

cont(achi(x)< 7 i(x)) = vac cont(6,i(x)<?i (x)) for some unit v £ R 

= v\ac cont(hi(x)) cont(pi(x)) for some unit vi £ R 
— v\acv 2 Vi for some units 212,^3 £ R- 

Hence, bd — acw for some unit w £ R. Thus, /(x) = whi{x)g\{x) for some unit 
w £ R. This shows that fix) is not irreducible in f?[x], which is a contradiction. 
Therefore, /(x) is irreducible in Q(i?)[x]. Conversely, let /(x) be irreducible in 
Q(R)[x]. Suppose /(x) is reducible in R[x]. Now /(x) = rg(x), where r £ R 
and r is a not a unit is impossible since /(x) is primitive. Thus, there exist 
polynomials /i(x), f 2 {x) in R[x] such that deg /i(x) > 1, deg/ 2 (x) > 1, and 
/(x) = fi (x)/ 2 (x). Now fi(x) and f 2 (x) are also nonconstant polynomials in 
Q{R)[x]. Hence, /(x) is not irreducible in Q(R)[x], a contradiction. Conse¬ 
quently, /(x) is irreducible in R[x}. ■ 

Example 16.2.9 Consider the polynomial 4x + 4 in Q[x]. Now 4x + 4 = 4(x + 
1). 4 is a unit in Q[x] and x+l is irreducible in Q[x]. Hence, 4x+4 is irreducible 
in Q[x]. But 4 is not a unit in Z[x]. Hence, 4x + 4 is not irreducible in Z[x]. 
Also, 3 is irreducible in Z[x], but 3 is not irreducible in Q[x]. 

We are now in a position to prove our main result of this section. Before 
proving this theorem, let us recall the following assertions concerning the poly¬ 
nomial ring R[x] so that we can enjoy the beauty and depth of this theorem, 

(i) If R is a commutative ring with 1, then R[x] is a commutative ring with 


(ii) If R is an integral domain, then i?[x] is an integral domain. 

(iii) If R is a field, then R[x] is not a field, but R[x] is a Euclidean domain. 

(iv) If R is a PID, then R[x] may not be a PID. 
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Theorem 16.2.10 Let R be a UFD. Then R[x} is a UFD. 

Proof. Let f(x) be a polynomial of degree n > 1. Let f(x ) = Cffi(x), 
where Cf is a content of f(x) and fi(x) is a primitive polynomial in R[x]. 
Now Q(R)[x] is a UFD and f\(x) G R[x\ C Q(i?)[a:]. Therefore, there exist 
irreducible polynomials g\(x), g2{x)i - • ■ ? 9r( x ) in Q(A1)[£] such that fi(x) = 
gi(x)g2(x) •• -gr{x). By Lemma 16.2.7, gi(x) = a t b~ l hi(x), a l} b t G R, h ^ 0, 
and hi{x) is a primitive polynomial in i = l,2,...,r. Also, by Lemma 

16.2.8, hi(x ) is irreducible in R[x], i = 1,2,... ,r. Hence, 

fi(x) = aia ,2 • • • a r 6^ 1 62 1 • ■ • b~ l h\{x ) ■ • • h T (x). 

Let a — a\a 2 • ■ ■ a r and b = bib 2 • • • b T . Then 

bf\(x) = ah\{x) ■ - ■ h T (x). (16.1) 

By Lemma 16.2.3, h\(x) ■ * • h r (x) is primitive. This implies that a — ub for 
some unit u E R and so 


fi(x) = uhi(x) • • ■ h T {x). 


This shows that 

f(x) = ucfhi(x)-‘-h r (x). (16.2) 

Since an associate of an irreducible polynomial is also an irreducible polynomial, 
it follows that uhi{x) is irreducible. Thus, for any polynomial f(x) of degree 
>1, there exist irreducible polynomials gi(:r),... ,g k {x) in i?[x] such that 

f{x) = c f gi{x) •■■g k (x), 

where Cf — cont f(x). If c/ is not a unit, then there exist irreducible elements 
ai , <22 ,.. ■, at £ R such that 

f(x) = aia 2 - • ■ a t gi(x) • ■ ■ g k (x). (16.3) 

Suppose now that 

f(x) = aia 2 ■ • ■ a t gi{x) ■ ■ ■ g k {x) = & 1&2 • • ■ khi(x) • • • h q {x), ( 16 . 4 ) 

where a {, bj are irreducible elements in R, i = 1,..., t, j — l.... ,1 and 

gi(x),...,g k {x), h l (x),... ) h q {x) 

are irreducible elements in i?[ar]. Now aia 2 ■ ■ - a t and b\b 2 • • -bi are two factor¬ 
izations as a product of irreducible elements in R of Cf. Therefore, by (16.4) 

9i( x ) ■ --gkix) = dhi(x) • • • hJx ), (16.5) 
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where d is a unit in R. Now (71 (x),... ,gk{x), /ii(x),..., h q (x) are primitive and 
irreducible in R[x]. Hence, these polynomials are also irreducible in Q(R)[x\. 
Since Q(R)[x] is a UFD, Eq. (16.5) implies that k — q and there exists a 
one-one correspondence between {<?i(x),... ,g^{x)} and {/ii(x),... ,h q (x)} such 
that the corresponding factors are associates in Q(R)[x] and hence by Lemma 
16.2.7, they are also associates in 12[x]. Thus, the factorization (16.4) of /(x) 
in R[x\ is unique. Consequently, i?[x] is a UFD. ■ 

Corollary 16.2.11 Let R be a UFD. The polynomial ring R[x\ ,... , x n ] is a 

ufd. m 1 

We see that the polynomial ring F[x,y] is a unique factorization domain. 
However, F[x,y] is not a Euclidean domain. This can be verified by showing 
that F[x,y] is not a principal ideal ring. We ask the reader to show in the 
exercises that the ideal (x,y) in F[x,y] is not a principal ideal. 

As shown in Example 16.1.11, Z[i\/5] is not a UFD. Thus, even though 
the polynomial ring T[x] is a unique factorization domain, a ring of the form 
F[c] need not be one. Thus, the homomorphic image of a unique factorization 
domain need not be a unique factorization domain. 


16.2.1 Worked-Out Exercises 

0 Exercise 1 Let /(x) be a nonzero polynomial in Z[x\. Show that f(x) can 
be expressed a s a product of two polynomials g{x) and h(x) of Q[x] with 
degg(x) < deg/(x) and deg h{x) < deg/(x) if and only if there exist 
gi(x ), h\{x) £ Z[x] such that degg(x) = deggi(x), deg/i(x) = deg h\ (x), 
and f(x) = gi(x)hi(x). 

Solution: Suppose there exist g(x) and h(x) in Qjx] with degg(x) < 
deg/(x), degh(x) < deg/(x), and /(x) = g{x)h{x). There exist nonzero 
elements a,b,c,d £ Z and primitive polynomials g 2 ^x ), h 2 (x) £ Z[x] such 
that g(x) — < 2 & -1 <? 2 (x) and h{x) = cd~ l h, 2 {x) by Lemma 16.2.6. Hence, 
f(x) = ab~ l cd~ l g 2 (x)h 2 {x). This implies that bdf(x ) = acp 2 (x)/?, 2 (x). Let d\ be 
the content of f(x). Then we can write /(x) = c?i/i(x), where fi(x) is a prim¬ 
itive polynomial in Z[x]. Hence, bdd\f\(x) — acg 2 {x)h 2 {x ). Now < 72 (^)^2^) is 
also a primitive polynomial. Then bdd\ — uac for some unit u £ Z. This implies 
bddi = ac or bddi — —ac. Hence, fi(x) — g 2 {x)h 2 {x) or f\{x) — —g 2 {x)h 2 {x). 
Let gi(x) = dig 2 (x). Now f(x) = di/i(x) = dig 2 (x)h 2 {x) = 9 i(x)hi(x), where 
hi(x) = h 2 (x) or f(x) = difi(x) = -dig 2 (x)h 2 {x) = pi(x)hi(x), where 
h\(x) = —/i 2 (x). Also, from the construction, it follows that degg 2 (x) = 
deggi(x) = degg(x) < deg/(x) and degh 2 (x) = deg h\ (x) = degh(x) < 
deg/(x). The converse is trivial. 


Exercise 2 Show that Z[x] is a UFD, but not a PID. 
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Solution: Since Z is a UFD, Z[x] is a UFD by Theorem 16.2.10. (By 
Corollary 15.1.11, Z[x] is not a PID. However, here we want to show that Z[x] 
is not a PID by showing the existence of ideals in Z[x], which are not principal.) 
Consider 

I = (x) + (n) , 

where n E Z, n {0,1, —1}. We claim that I is not a principal ideal. Suppose 
I — (f(x)) , where f{x) E Z[x\. Then (n) C (f(x)). Therefore, n = f(x)g(x) 
for some g(x) E Z. Since degn = 0, deg /(x) = 0 and hence /(x) E Z. Let 
f(x) = a € Z. Now (x) C (a) . Then x = ah(x ) for some h(x) E Z[x]. Again 
by a degree argument, deg/i(x) = 1. Let h{x ) — ao + a\x, where ao,ai E Z, 
ai 7 ^ 0. Then x = a(ao + a\x). Hence, 1 = aoi E (a) = / = (x) + (n) . Thus, 
1 = xs(x) -f- n£(x) for some s(x),£(x) E Z[x]. Let £(x) — t o + Ux-f • • • +t r x T . 
Then by comparing coefficients in 1 = xs(x) +nt(x), we get 1 = uIq. Hence, n 
divides 1, which is a contradiction. Therefore, I is not a principal ideal. 

16.2.2 Exercises 

1. Let fix) E Z[x\ be irreducible. Prove that /(x) is primitive. 

2. Let /(x) be a nonconstant primitive polynomial in Z[x]. Prove that if 
/(x) is not irreducible in Q[x], then /(x) is not irreducible in Z[x]. 

3. Show that the polynomial ring Q[x,y] is a UFD, but not a PID. 

4. Let R be a UFD. Let /(x) be a primitive polynomial in R[x\. Show that 
any nonconstant divisor of /(x) is also a primitive polynomial. 


16.3 Irreductibility of Polynomials 

In the previous section, we proved that any polynomial of degree > 1 over a 
UFD can be expressed as a product of irreducible polynomials. Thus, irre¬ 
ducible polynomials play an important role in polynomial rings. But it is not 
always easy to determine if a polynomial is irreducible over a UFD. In this 
section, we establish some criteria for irreducibility of polynomials. We first 
note that any polynomial of degree 1 over a field F is always irreducible. If 
/(x) = ax 4- b E F[x] with a ^ 0, then x = —a~ l b is a root of /(x) in F. In 
this connection, let us point out that a linear polynomial over a UFD D may 
not be irreducible in D[x\. For example 2x + 4 = 2(x + 2) is not irreducible in 
Z[x]. We now consider polynomials of degree 2 and 3. For these polynomials, 
we can apply the following test to check irreducibility. Let F denote a field. 


Theorem 16.3.1 Let /(x) E F[x] be a polynomial of degree 2 or 3. Then /(x) 
is irreducible over F if and only if /(x) has no roots in F. 
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Proof. Suppose that deg f(x) = 3 and f(x) is irreducible. If f(x) has a 
root in F , say a, then x — a divides f(x) in F[x] and so f(x ) is reducible over 
F. Conversely, suppose f(x) has no roots in F. Assume that f{x ) is reducible. 
Then f{x ) = g{x)h(x) for some g{x ), h{x) G F[x], degp(x) > 1 and deg h(x) > 
1 . Now deg(^(x)/i(a:)) = 3. Therefore, either degg(a;) = 1 and deg/i(x) = 
2 or degh(;r) = 1 and degg(x) = 2. To be specific, let deg<?(;r) = 1 and 
deg h{x) = 2. Then g(x) = ax + b for some a, b G F, a ^ 0. Now — a~ l b G F 
and g(—a~ l b) = 0. Thus, — a~ l b is a root of g{x) and hence — a~ l b is a root of 
f(x) in F. This is a contradiction to our assumption that f(x) has no roots in 
F. Hence, f(x) is irreducible over F. A similar argument can be used for the 
case when deg f(x) = 2. ■ 

Example 16.3.2 (i) Let f(x) = x 2 + x + [1] G Z 2 [a^] - Now 

/([ 0 ]) = [ 0] 2 + [ 0 ] + [ 1 ] ^ [ 0 ], 

/([ 1 ]) = [ 1] 2 + [ 1 ] + [ 1 ] = [ 1 ]^[ 0 ], 

Hence, f(x) has no roots in Z 2 . Thus, by Theorem 16.3.1, f{x) is irreducible 
over 7 j2- 

(ii) Let g{x) = x 3 -(- [2}x + [1] G Z 3 [x]. Now 

s ([ 0 ]) = [ 0] 3 + [ 2 ][ 0 ] + [ 1 ]^[ 0 ], 

s-([l]) = [l ] 3 + [2][1] + [1] = [4] = [1] ^ [0], 

and 

ff([2]) = [2 ] 3 + [ 2 ][ 2 ] + [ 1 ] = [13] = [ 1 ] ^ [ 0 ], 

Hence, g(x) has no roots in Z 3 . Thus, by Theorem. 16.3.1, g(x) is irreducible 
over Z 3 . 


Instead of considering polynomials over an arbitrary field, let us now con¬ 
sider polynomials over the field Q of all rational numbers. By Lemma 16.2.8, 
a nonconstant primitive polynomial f(x) G Z[x\ is irreducible in Q[x] if and 
only if f(x) is irreducible in Z[x]. It is not difficult to decide whether or not a 
polynomial is primitive. In order to decide whether or not f(x) is irreducible, 
we sometimes consider the corresponding polynomial in Z p [x] for some prime 

P- 


Theorem 16.3.3 Let f(x) = ao + a\x +-h a n x n G Z[x] be of degree n > 1. 

If there exists a prim.e p such that f(x) — [ao] + [ai]x + ■ • • + [a n ]cc n is irreducible 
in Z p [x] and deg f(x) = deg f(x), then f(x) is irreducible in Q[z]. 

~Pu*uc- 
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Proof. Suppose /(x) satisfies the given conditions of the theorem for some 
prime p. Suppose /(x) is reducible in Q[x]. Then there exist polynomials g{x) = 

bo + b\x + • —b b rn x rn and h{x) = cq + c\x H-b CfcX fc in Z[x\ , 0 < m < n, 0 < 

k < n such that f(x) = g{x)h{x) by Worked-Out Exercise 1 (page 379). Thus, 

[a 0 ] + [ai]xj-b[a n ]^ n = ([bo] + [&i]aH-f [ 6 m ]x m ) ([co]+ [ci]xH-b[c*]x fc ). 

Since deg/(x) = deg/(x) = n = k m, it follows that [ 6 m ][c^] y^ 0 in Z p . 
Hence, [b m ] y^ [0] and [cjt] y^ [0]. Consequently, g(x) and h{x) are nonconstant 
polynomials in Z p [x\. Since the units of Z p [x] are the nonzero elements of Z p , 
it follows that g{x) and h{x) are nonunits. Therefore, f(x) is not irreducible 
in Z p [x], a contradiction. Hence, f(x) is irreducible in Q[x]. ■ 

Example 16.3.4 Consider the polynomial /(x) = |x 3 — \x + 1 in Q[x]. Then 
14/(x) = 10x 3 — 7x + 14. Let /i(x) = 10x 3 — 7x + 14. Now in Zs[x], /i(x) = 
[10]x 3 - [7}x + [14] = I 3 - x + [2pSince 7T([0]) = [2], 7T([1]) = [2], 7T([2]) = 
[2] 3 — [2] -f- [ 2 ] = [2], it follows that /i(x) has no root in Z 3 [x]. As a result 14/(x) 
is irreducible in Q[x], But 14 is a unit in Q[x]. Hence, /(x) is irreducible in 
Q[x]. 

Let /(x) E Q[x] and deg/(x) > 2. If /(x) has a root in Q, then /(x) is 
reducible. The following theorem will help us to see whether a polynomial 
fix) E Q[x] has a root in Q. 

Theorem 16.3.5 Let /(x) = ao + cl\x + • • • + a n x n E Z[x] be of degree n and 
ao i=- 0. Let ^ € Q be a root of /(x), where u and v are relatively prim,e. Then 

u\ao and, v\a n . 


Proof. Since ^ is a root of /(x), 

. *U /. u ,u s „ 

0 = /(—) = ao + ai—b ■ • • + a , n {—) . 
v v v 

Thus, 


0 = aov n + a\uv n 1 -b • • • + a n _iu n 1 v + a n u n . 

Hence, 

r;(aou n-1 + aiuv n ~ 2 +-b n_1 ) = —a n u n . 

This implies that v\a n u n . Since u and v are relatively prime, v\a n . 
u |oq. ■ 


Similarly, 


Example 16.3.6 Let /(x) = 2x 3 — 7x + 1 and ^ E Q be a root of /(x) with 
gcd(u, v) = 1. Then u\l and v\2. Hence, u = ±1 and v = ±1, ±2. This implies 


that g = ±l,±y. Now /(1) y^ 0, /(-1) ± 0, f(%) = \ - \ + 1 y^ 0, and 
f(—b) = — \ + \ + 1 y^ 0. So we find that /(x) has no root in Q. Thus, by 


Theorem. 16.3.1, /(x) is irreducible in Q[x]. Since /(x) is primitive, f{x) is also 
irreducible in Z[x\. 
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Let us now give another criterion for irreducibility. This famous criterion 
is known as Eisenstein’s irreducibility criterion. 

Theorem 16.3.7 (Eisenstein’s Irreducibility Criterion) Let D be a UFD 

and Q(D) be its quotient field. Let 

f(x) = clq + a\x + • ■ • + a n x n 

be a nonconstant polynomial in D[x\. Suppose that D contains a prim.e p such 
that 

(i) p\m , i — 0,1,... ,n - 1, 

(ii) p / a n , and 
(in) p 2 f a 0 . 

Then f{x) is irreducible in Q(D)[x\. 

Proof. Case 1. fix) is a primitive polynomial in D[x\. Under this as¬ 
sumption, if we can show that f{x) is irreducible in D[x\, then by Lemma 
16.2.8, it will follow that f(x) is irreducible in Q(D)[x\. Suppose that f{x) is 
not irreducible in D[x\. Then there exist polynomials 

g(x) = bo + b\x 4--h b t x t 

h{x) = Co + c\x 4- ■ • • + CkX k 

in D[x] such that f{x) = g(x)h{x) and g{x) and h(x) are nonunits in D[x\. Now 
n = t+k. If t = 0, then g[x) = 6o, a nonunit element of D. Thus, f(x) = boh(x) 
implies that f(x) is not primitive. Therefore, t ^ 0. Similarly, k 0. Hence, 
0 < t < n and 0 < k < n. Now from f{x) = g[x)h{x), we find that ao — i>o c O- 
Since p is a prime such that p|ao and p 2 f ao> it follows that p divides one of 
bo , Co, but not both. Suppose p\bo and p )[ co- Since p / a n and a n = btCk , p 
l bt and p / Ck . Thus, p\bo and p \ bt. Let m be the smallest positive integer 
such that p / b m . Then p\bi for 0 < i < m < t. Now considering the coefficient 
of x m in f{x) and g(x)h(x), it follows that 

= boCjn + bic m —i + • • • T b rn —\C\ + 6 m co. 

Since p\bi, 0 < i < m, we find that p\{a m — bmCo). Since m < t < n, p\a m . 
Hence, p|6 m co and so p\b m or p|co since p is prime. This is a contradiction. 
Therefore, f{x) is irreducible in D[x] and hence in Q[D)[x\. 

Case 2. f(x) is not a primitive polynomial in D[x\. Let d — gcd{ao, 
ai,..., a n } in D. Then fix) = dfi{x), where f\{x) is a primitive polynomial in 
D[x\. Let fi(x) = do + d\x + • ■ • + d n x n . Then a * = for alH = 1, 2,..., n. 
Since p does not divide a n , p does not divide d. Therefore, it now follows that 
p\d{, i — 0, 1 ,..., n — 1 , p fd n and p 2 /do- Thus, by Case 1, f\{x) is irreducible 
in Q(D)[x]. Now d is a unit in Q(D). Hence, f{x) is irreducible in Q(_D)[aT ■ 
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Corollary 16.3.8 Let D be a UFD and f(x) = ao + a\x + • • • + a n x n be a 
nonconstant primitive polynomial in D[x\. Suppose that D contains a prim,e p 
such that 

(i) p\ai, i = 0,1,... ,n - 1, 

(ii) p / a n , a.nd, 

(in) p 2 / a 0 . 

Then f(x) is irreducible in D[x\. ■ 

Corollary 16.3.9 Let f{x) — ao+a\x+■ ■ --HOnX 71 be a nonconstant polynomial 
in Z[x\. If there exists a, prime p such that 

(i) p\ai , i = 0,1,... ,n - 1, 

(ii) p / a n , and 
(m) p 2 / a 0 , 

then fix) is irreducible in Q[x]. ■ 

Corollary 16.3.10 The cyclotomic polynomial 

i x p - 1 

(f)p(x) — 1 + X + • • • + x p — -— 

X' 1 

is irreducible in Zi[x] y where p is a prim,e. 


Proof. Since the content of (j> p (x) is 1, we find that <f> p (x) is a primitive 
polynomial. Suppose 4> p {x) is not irreducible in Z[x\. Then there exist non¬ 
trivial factors h(x) and g{x) of <f p (x) such that <f p {x) = h{x)gix). This implies 
that (f) p (x -1- 1) = h[x + 1 )g{x + 1) is a nontrivial factorization of (f) p (x + 1). 
However, 


(aH-l)P-l 
(x+l)-l 

x p +px p ~ 1 -\ -- \-px 

p + • • • + (X) x 2 ~^ + • • • + px p ~ 2 + 
is clearly irreducible by Eisenstein’s criterion. Hence, 4> p (x) is irreducible in 

z[x\. m 

Gauss is said to have placed Eisenstein at the same mathematical level as 
Newton and Archimedes. However, Eisenstein’s influence on mathematics is 
considered to be small in comparison to that of the giants of mathematics. 

16.3.1 Worked-Out Exercises 

0 Exercise 1 Show that f(x) = x 3 + [2]x + [4] is irreducible in Zs[x]. 

T^nfuc. 


4> p {x + 1) = 
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Solution: /([0]) - [4], /([lj) = [7] = [2], /([2]) - [3] + [4] + [4] = [1], 
/([3]) - [2] + [1] + [4] = [2], /([4]) = [4] + [3] + [4] - [lj. Hence, f(x) has no 
roots in Z 5 . Thus, by Theorem 16.3.1, /(x) is irreducible in Z 5 [cc]. 

<0 Exercise 2 Let f(x) = x 6 + x 3 + 1 6 Z[x], Show that /(x) is irreducible 
over Q. 

Solution: Now /(x + 1) = x 6 + 6 x 5 + 15x 4 + 21x 3 + 18a: 2 + 9a: + 3. Let 
p — 3. Then by Eisenstein’s criterion, /(x + 1) is irreducible over Q. Hence, 
f{x) is irreducible over Q. 

0 Exercise 3 Show that f{x) — x 4 — 5a : 2 + x + 1 is irreducible in Z[xj. 

Solution: Let us first show that f{x) is irreducible in Q[xj. If /(x) has a 
linear factor, then /(x) has a root in Q. Let | (a, b are relatively prime) be a 
root of /(x) in Q. Then 6|1 and a|l by Theorem 16.3.5. Hence, | = 1 or — 1 . But 
/( 1 ) = 1 - 5 + 1 + 1 = -2 ± 0 and /(-l) = 1 - 5 - 1 + 1 = —4 ^ 0. Therefore, 
/(x) has no linear factors in Q[xj. Let f(x) — (x 2 + ax + b) (x 2 + cx + d) in 
Z[xj. Equating coefficients of powers of x, we find that 

c + a = 0, d + 5 + ac— —5, ad + be = 1, bd = 1. 

Now bd — 1 implies that either b = d= l or b = d = — 1 . Suppose b = d = 1 . 
Then a + c = 1. But we also have a + c — 0, a contradiction. Suppose b — d — 
—1. Then ad-\- be = 1 implies that a + c — —1. Thus, a + c = —1 and a + c — 0, 
a contradiction. Hence, we find that there are no integers a, 6 , c, d such that 
f{x) — (x 2 + ax + b){x 2 + cx + d). This also implies that f{x) cannot be factored 
as a product of two quadratic polynomials in Q[x] (see Worked-Out Exercise 
1 , page 379). Thus, f{x) is irreducible in Q[xj. Hence, by Lemma 16.2.8, f(x) 
is irreducible in Z[x]. 

0 Exercise 4 Show that f{x) = x 5 4- 15a: 3 + lOx + 5 is irreducible in Z[x\. 

Solution: The content of f{x) is 1 . Therefore, /(x) is a primitive poly¬ 
nomial. Now 5 is a prime integer and 5|5, 5[ 10, 510, 5115, 5 / 1, 5 2 /5. Hence, 
by Corollary 16.3.8, f{x) is irreducible in Z[xj. 

<0 Exercise 5 Give an example of a primitive polynomial which has no root 
in Q, but is reducible over Z. 

Solution: Let f{x) = x 4 + 2 x 2 + 1 . This is a primitive polynomial in Z[xj. 
If possible, let | be a root of /(x), where a ^ 0, b / 0 and gcd(a, b) = 1. Then 
a\\ and 6|1 by Theorem 16.3.5. Hence, | = ±1. But /(1) ^ 0 and /(—1) 0. 

Therefore, /(x) has no root in Q. Since /(x) = (x 2 + l)(x 2 + l), /(x) is reducible 
in Z[xj. 
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Exercise 6 Show that x 2 + x + [1] is the only irreducible polynomial of degree 
2 over Z 2 . 

Solution: Any polynomial of degree 2 over Z 2 is of the form ax 2 + bx + c, 
where a, b,c £ Z 2 = {[0], [1]}. Now a / [0]. Therefore, a = [1]. Then x 2 , x 2 + x, 
x 2 + [1], and x 2 + x + [1] are the only polynomials of degree 2 over Z 2 - Now 
x 2 — xx, x 2 + x = x(x + [ 1 ]), and x 2 + [ 1 ] = (x + [l])(x + [ 1 ]) showing that x 2 , 
x 2 + x, and x 2 + [1] are reducible. Let /(x) = x 2 + x + [1]. Then /([0]) = [1] ^ 0 
and /([ 1]) — [3] — [1] 7 ^ 0. Therefore, fix) has no root in Z 2 . Thus, x 2 + x + [1] 
is irreducible over Z 2 . 


16.3.2 Exercises 

1. Find all irreducible polynomials of degree < 2 in Z 2 [cc]. Is x 3 + [ 1 ] ir- 
reducible in Z 2 [x]? If not, then express it as a product of irreducible 
polynomials in Z 2 • 

2. Show that the polynomial x 5 + x 2 + [ 1 ] is irreducible in ^[x]. Hence, 
prove that x 5 — x 2 + 9 is irreducible in Z[x\. 

3. Show that the polynomial x 2 + [2]x + [ 6 ] is reducible in Z 2 [x] even though 
x 2 + 2x + 6 is irreducible in Z[x]. 

4. Use Eisenstein’s criterion to prove that the polynomials x 2 + 2x + 6 and 
2 x 4 + 6 x 3 — 9x 2 + 15 are irreducible over Z. 

5. For f(x ) E D[x], D a UFD, prove that f(x) is irreducible in D[x] if and 
only if fix — c ) is irreducible in D[x] for any c 6 D. 

6 . Show that the polynomials x 3 — x 2 + 1 , x 3 — x + 1 , and x 3 + 2x 2 + 3 are 
irreducible in Z[x]. 

7. Show that the polynomial 2x 3 — x 2 + 4x — 2 is not irreducible in Z[x]. 

8 . Show that the polynomial x 2 + ^x — | is irreducible in Q[x]. 

9. Prove that the polynomial /(x) = 1 — x + x 2 — x 3 H-f (— l) p- 1 x p_1 is 

irreducible in Z[x] for any prime p. 

10. Let D be a UFD and /(x) = ao + a\x + • • • + a n x n E D[x] be of degree 
n and ao 7 ^ 0. Let uv~ l E QiD) be a root of fix), where u,v E D and 
gcd(n,u) = 1. Prove that u|ao and v\a n in D. 

11 . Show that for any positive integer n > 1, /(x) = x n + 2 is irreducible in 

zM- 

12. Find all irreducible polynomials of degree 2 over the field Z 3 . 

~Pii*uc- 7^o^Lc#fta£uiaZ 
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13. If f(x) is an irreducible polynomial over R, prove that either f(x) is 
linear or f(x) is quadratic. 

14. Show that there are only three irreducible monic quadratic polynomials 
over Z 3 . 

15. (i) Show that there are only 10 irreducible monic quadratic polynomials 
over Z 5 . 

(ii) Let p be a prime. Find the number of irreducible monic quadratic 
polynomials over Z p . 
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Leopold Kronecker (1823-1891) was 
born on December 7, 1823, in Liegnitz, Ger¬ 
many, to a wealthy family. He was pro¬ 
vided with private tutoring at home. He 
later entered Liegnitz Gymnasium, where E. 
E. Kummer was his mathematics teacher. 
Rummer recognized his talent and encour¬ 
aged him to do independent research. 

In 1841, he matriculated at the Univer¬ 
sity of Berlin. There he attended Dirichlet’s 
and Steiner’s mathematics lectures. He was 
also attracted to astronomy and in 1843 at¬ 
tended the University of Bonn. He returned 
to Berlin in 1845, the year he received his 

Ph.D. His thesis was on complex units. 

On Rummer’s nomination, Kronecker became a full member of the Berlin Academy 
in 1861. He was very influential at the Academy and personally helped fifteen mathe¬ 
maticians, including Riemann, Sylvester, Dedekind, Hermite, and Fuchs, to get various 
memberships. 

Kronecker’s primary work is in algebraic number theory. He is believed to be one 
of the inventors of algebraic number theory along with Kummer and Dedekind. He 
was the first mathematician who clearly understood Galois’s work. He also proved the 
fundamental theorem of finite Abelian groups. 

Briefly Kronecker withdrew from academic life to manage the family business. 
However, he continued to do mathematics as a recreation. In 1855, he returned to the 
academic life in Berlin. In 1880, he became editor of the Journal fur die reine and 
angewandte Mathematik. 

Kronecker and Weierstrass were good friends. While Weierstrass and Cantor were 
creating modern analysis, Kronecker’s remark that “God himself made the whole 
numbers—everything else is the work of men” deeply affected Cantor, who was very 
sensitive. His remarks in opposition to Cantor’s work are believed to be a factor in 
Cantor’s nervous breakdown. 

Kronecker died on December 29, 1891. 






Chapter 17 

Maximal, Prime, and Primary 
Ideals 

17.1 Maximal, Prime, and Primary Ideals 

In this section, we introduce certain special ideals. These ideals are motivated 
in large part by certain arithmetic properties of the integers. Throughout the 
section, we assume that the ring R contains at least two elements. 

Definition 17.1.1 An ideal P of a ring R is called, prime if for a.ny two ideals 
A and B of R , AB C P implies that either A C P or B C P. 

The following theorem gives a useful characterization of a prime ideal with 
the help of elements of R. Let us first recall that if A is a left ideal and B is 
a right ideal of a ring R , then AB is an ideal of R. Let a € R. Then Ra is a 
left ideal of R and aR is a right ideal of R. Thus, R(aR) is an ideal of R. We 
denote R(aR) by RaR. Also, for a € R, aRa — {ara \ r € R}. 

Theorem 17.1.2 An ideal P of a ring R is a prim,e ideal if and only if for all 
a,b £ R, aRb C P implies that either a £ P or b £ P. 

Proof. Suppose P is a prime ideal and aRb C P, where a, 6 £ R. Let A = 
RaR and B = RbR. Then A and B are ideals of R. Also, AB = ( RaR)(RbR ) C 
R(aRb)R C RPR C P. Since P is a prime ideal, it follows that either A C P or 
PCP. Suppose A C P. Now (a ) 3 C RaR — A C P. Since P is a prime ideal, 
(a) C P and so a £ P. Similarly, if B C P, then b £ P. Thus, either a £ P 
or b £ P. Conversely, suppose that the ideal P satisfies the given condition of 
the theorem. Let A and B be two ideals of R such that AB C P. Suppose 
that A P. Then there exists a £ A such that a ^ P. Let b £ B. Now 
aRb = ( aR)b C AB C P. This implies that a E P or b G P. But a P. 
Therefore, b E P. Hence, BCP.I 


“Public. TfeoiAewtaitca/ 
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Corollary 17.1.3 Let R be a commutative ring. An ideal P of R is a prim,e 
ideal if and only if for all a,b E R, ab E P implies that either a E P orb GP. I 

Example 17.1.4 In the ring Z of integers, the ideal P — {3/c | k E Z} is a 

prim.e ideal. For, ab E P if and only if ab is divisible by 3 if and, only if a 

is divisible by 3 or b is divisible by 3 (since 3 is prim,e) if and only if a is a 

multiple of 3 or b is a multiple of 3 if and, only if a € P or b E P. In Z, the 
ideal J = {6 k | k E Z} is not a prim,e ideal since 3-2 = 6 E J, but 3 ^ J and 
2 (£ J. 

Theorem 17.1.5 Let R be a PID and P be a, nonzero ideal of R. Then P is 
prim,e and P / R if and only if P is generated, by a prim.e elem.ent. 

Proof. Let R be a PID and P = (p) be a nonzero proper prime ideal of R. 
Then p / 0. Since P / R, p is not a unit. Let a, b E R be such that p\ab. Then 
ab — pc for some c E R. Hence, ab E P. Since P is a prime ideal, either a E P 
or b E P. Therefore, either p\a or p\b. Thus, p is a prime element. Conversely, 
suppose that P — (p) is a nonzero ideal of R such that p is a prime element. 
Since p is not a unit, P ^ R. Let a, b be two elements of R such that ab E P. 
Then p\ab. Since p is a prime element, either p\a or p\b. Therefore, either a E P 
or b E P. Hence, P is a prime ideal of R. M 

As a consequence of Theorem 17.1.5 and Theorem 15.1.9, the prime ideals 
of Z are precisely those ideals generated by primes and the ideals {0} and Z. 
Also, by Theorem 15.3.16, the prime ideals in the polynomial ring ,F[x] over 
a field F are those ideals generated by irreducible polynomials and the ideals 
{ 0 } and jF[x]. 

Definition 17.1.6 Let R be a ring and M be an ideal of R. Then M is called 
a maximal ideal of R if M ^ R and there does not exist any ideal I of R 
such that M C / C R. 

Let T(R) be the collection of all proper ideals of R. Since {0} E T(R) : 
T(R) ^ <f>. Now (T(R),<) is a lattice, where < is the set inclusion relation. 
Clearly a maximal element (if one exists) of this lattice is a maximal ideal of 
the ring R. 

Theorem 17.1.7 Let R be a commutative ring with 1. Then every maximal 
ideal of R is a prime ideal of R. 


Proof. Let I be a maximal ideal of R and a and b be two elements of R 
such that ab E I and a £ I. Now (I, a) = {u + ra \ u E /, r E R} is the ideal 
generated by I U {a}. Since a £ /, I C (/, a) . Also, since I is a maximal ideal. 
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(I, a). = R. Thus, there exist u £ / and r £ R such that 1 = u 4- ra. This 
implies that 6 — ub + rab £ I. Hence, / is a prime ideal. ■ 

The converse of the above theorem is not true, as shown by the following 
examples. 

Example 17.1.8 In the ring Z of integers, {0} is a prim,e ideal, but not a 
m,axim,al ideal. 

Example 17.1.9 Let R = {( a,b ) | a,b £ Z}. Then (#,+,•) is a ring, where 
+ a.nd • are defined, by 


(a, b) T (c, d) = ( a + c,bPd ), 

(a, 6 ) ■ (c, d) = ( ac , bd) 

for all a,b,c,d £ Z. Let I — {(a,0) | a £ Z }. Then I is a prime ideal of R, but 
not a m,axim,al ideal since I C (I, (0,2)) C R. 

Theorem 17.1.10 Let R be a principal ideal dom,ain. Then a nonzero ideal 
P ^ R) of R is prim.e if and only if it is maxim,al. 

Proof. Suppose P R) is a nonzero prime ideal. By Theorem 17.1.5, 
P = (p) for some prime element p £ R. We now show that there is no ideal 
I of R such that P C I C R. Suppose I is an ideal of R such that P C /. 
Since P ^ I, there exists an element a £ I such that a ^ P. Then a and p are 
relatively prime and so there exist s, t £ R such that 1 = sa + tp. Since sa £ I 
and tp £ P C I, we must have 1 £ I. This implies that I — R. Hence, P is 
maximal. ■ 

We now give characterizations of prime ideals and maximal ideals in a 
commutative ring with identity by the quotient rings of the ideals. 

Theorem 17.1.11 Let R be a com,mutative ring with 1 a,nd P be a,n ideal of 
R such that P 7 ^ R. Then P is a prim,e ideal if and only if R/P is an integral 
dom,a,in. 


Proof. Let P be a prime ideal of R. Since R is a commutative ring with 
1, the quotient ring R/P is also a commutative ring with 1. Now P 7 ^ R and 
so the identity element 1 -f P of R/P is different from the zero element 0 + P. 
Let us now show that R/P has no zero divisors. Let a 4 - P, b + P £ R/P, and 
(a + P)(b + P) = 0 T P. Then ab + P = 0 + P, which implies that ab £ P. 
Since P is a prime ideal, either a £ P or b £ P, i.e., either a + P = 0 + For 
b + P = 0 + P. Thus, R/P has no zero divisors. This implies that R/P is an 
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integral domain. Conversely, suppose R/P is an integral domain. Let ab G P. 
Then 0 + P — ab + P = (a + P) (b + P) , whence aTP = 0 + P or b-\-P = 0 + P. 
Thus, a G P or b G P and so P is a prime ideal. ■ 

Theorem 17.1.12 Let R be a commutative ring with 1 and M be an ideal of 
R. Then M is a m.axim,al ideal if and only if R/M is a field. 


Proof. Suppose that M is a maximal ideal. Since R is a commutative ring 
with 1 , R/M is a commutative ring with 1. For all a E R , let a denote the coset 
a + M in R/M. Let a G R/M be such that a ^ 0. Then a £ M. Hence, the 
ideal (M, a) generated by M U {a} properly contains M. Since M is a maximal 
ideal, we have (M, a) = R. This implies that there exist m G M and r G R 
such that 777 , + ra = 1. Thus, m + ra — 1 and so ra = 1. Hence, a has an 
inverse. This shows that every nonzero element of R/M is a unit and so R/M 
is a field. Conversely, suppose R/M is a field. Since R/M is a field, R ^ M. 
Let I be an ideal of R such that M C I C R. There exists a G I such that 
a /l M. Then a ^ 0 and so there exists r G R/M such that or = 1. Thus, 
(a + M)(r + M) = 1 + M, which implies 1 - ar G M, Hence, 1 = m + ar 
for some m G M. Thus, 1 = m + ar G M + I C I. This implies that I — R. 
Therefore, M is maximal. ■ 

As a consequence of Theorems 15.1.9 and 17.1.10, the maximal ideals of 
Z are precisely those ideals generated by primes. Also, by Theorem 15.3.16, 
the maximal ideals in the polynomial ring F[x] over a field F are those ideals 
generated by irreducible polynomials. 


Example 17.1.13 Consider the polynomial ring R[x,y\ over an integral do¬ 
main R. Then R[x,y\/ { x) ~ R[y] and R[x,y]/ (y) ~ i?[x], which are integral 
domains. Thus, { x) and (y) are prime ideals. Since R[x,y\/ (x) and R[x,y]/ (y) 
are not fields, (x) and, (y) are not maximal ideals. 


Example 17.1.14 Consider E, the ring of even integers. The ideal (4) is 
maximal, but not prim.e in E since 2-2 G (4), but 2 ^ (4) . Note that E is 
commutative without identity. 

We now show the existence of maximal ideals in certain rings. In order to 
accomplish this, we require Zorn’s lemma. 


Theorem 17.1.15 Let R be a. commutaiive ring with 1. Then every proper 
ideal of R is contained in a maximal ideal of R. 
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Proof. Let / be a proper ideal of R and set A — {J \ I C J, J is a proper 
ideal of R}. Since I E A, A ^ <fi. Also, A is a partially ordered set, where the 
partial order < is the usual set inclusion. We now show that any chain in A 
has an upper bound in A. Let C = {J a \ a E K} be a chain in A. Since I C J Q 
for all a, I C U a J Q . Let a, b E U a J a . Then a E Ja and b E Jp for some a , (5. 
Since C is a chain, either J a Q Jp or Jp C J Q , say, J a C Jp. Thus, a, b E Jp. 
Since Jp is an ideal of R, a — b E Jp C U Q J a - Let r E R. Then ra E J a C U a J a , 
whence U a J a is an ideal of R. Now U a J Q ^ R else 1 E J a for some a, which is 
impossible since J a ^ R. Hence, U Q J a G A, which is clearly an upper bound of 
C and so by Zorn’s lemma, A has a maximal element, say, M. We now show that 
M is a maximal ideal. If there exists an ideal J of R such that M C J C R, 
then J E A and so M is not maximal in A , a contradiction. Thus, no such J 
exists and so M is a maximal ideal. ■ 

Corollary 17.1.16 Let R be a commutative ring with 1 and a E R. Then a is 
in a maximal ideal of R if and only if a is not a unit. 

Proof. Suppose a is not a unit. Then (a) C R else 1 = ra for some r. 
By Theorem 17.1.15, there exists a maximal ideal M such that (a) C M. Now 
a E (a) C M. Conversely, suppose a E M, where M is a maximal ideal. If a is 
a unit, then 1 — a~ l a E M and so M = R, a contradiction. ■ 

Corollary 17.1.17 Let R be a commutative ring with 1 . Then R has a maxi¬ 
mal ideal. 


Proof. In R, {0} is a proper ideal. Hence, by Theorem 17.1.15, there exists 
a maximal ideal M of R such that {0} C M. ■ 

The fundamental theorem of arithmetic says that any integer n has a prime 
factorization n = p\ 1, ‘‘Ps s i where pi,...,p s are primes and ei,...,e s are 
positive integers. The ideals (pf) are prime ideals of Z. The ideals {p e f) are 
also special ideals of Z. Their study is motivated in part by the fundamental 
theorem of arithmetic. 


Definition 17.1.18 Let R be a commutative ring and Q be an ideal of R. Then 
Q is called a primary ideal if for all a, b E R, ab E Q and a Q implies that 
there exists a positive integer n such that b n E Q. 


From the definition of primary ideal, it follows immediately that every 
prime ideal in a commutative ring is a primary ideal. Now in the ring Z, for 
any prime integer p, the ideal { p n ) contains p n but not p, where n is a positive 
integer and n > 2. Hence, (p n ) is not a prime ideal. The following example 
shows that ( p n ) is a primary ideal. 
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Example 17.1.19 Letp be a prime in Z and n be a positive integer. We show 
that (p n ) is a prim,ary ideal. Let ab £ { p n ) and a £ ( p n ) . Then there exists 
r £ Z such that ab = rp n . Since p n does not divide a, p\b and so b = qp for 
some q £ Z. Thus, b n = q n p n and so b n £ { p n ) . 

Example 17.1.20 Letpix) be irreducible in F[x], F afield, and n be a positive 
integer. Then ( p{x) n ) is a primary ideal by an argument entirely similar to the 
one used in Example 17.1.19. 

Definition 17.1.21 Let R be a commutative ring and I be an ideal of R. Then 
the radical of I, denoted by \fl, is defined to be the set 

VI = {a £ R | a n £ I for some positive integer n}. 

Theorem 17.1.22 Let Q be an ideal of a commutative ring R. Then 

(i) VQ i s an ideal of R and VQ 5 Q, 

(ii) if Q is a prim.ary ideal, then VQ a prim,e ideal. 


Proof. (i) Clearly VQ 5 Q - Let a, b £ VQ- Then there exist positive 
integers n, m such that a n ,b m £ Q. Thus, (a — b) n+Tn £ Q and so a — b £ VQ- 
Let r £ R. Then (ra) n = r n a n £ Q and so ra 6 VQ- Hence, VQ an ideal of 
R. 

(ii) Let a, b £ R be such that ab £ VQ an d a VQ- There exists a positive 
integer n such that a n b n = ( ab) n £ Q. But a n £ Q. Since Q is primary, there 
exists a positive integer m such that b nTn = ( b n ) m £ Q. Therefore, b £ VQ and 
so VQ is prime. ■ 


Definition 17.1.23 Let Q be a prim,ary ideal of a commutative ring R. Then 
the radical P = VQ °f Q ? ' s called the associated prime ideal of Q and Q is 
called a primary ideal belonging to (or primary for) the prim.e ideal P. 


Example 17.1.24 Let i be a positive integer. In Z, we show tha,t (jp 1 ) is 
primary for (p) , where p is a prime. It suffices to show that (p) = yj(p l ). 

Let a £ yj(p l ). Then there exists a positive integer n such that a n £ (p z ) ■ 
Therefore, a n = rp 1 for some r £ Z. This implies that p\a and so a £ (p) . 
Hence, yj(p 2 ) C (p) . Let a £ (p) . Then there exists t £ Z such tha,t a = tp. 

This implies that a 1 = t l p l £ ( p 1 ) and so a £ yj ( p l ). Thus, (p) C yj(p l ). 

In F[x] (F a field), a similar argument shows that (jp{xY) is primary for 
ip(x)) , where p{x) is irreducible and (p(x)) = J (p(x) J ). 


Theorem 17.1.25 Let Q and P be ideals of a commutative ring R. Then 0 
is prim,a,ry and, P = VQ V an ^ ° n ly if 

(i) Q Q P C VQ and 

(ii) ab £ Q. a £ Q implies b £ P. 
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Proof. The necessity of (i) and (ii) is immediate. Suppose (i) and (ii) hold. 
Let ab E Q, a Q. Then 6 G P C y/Q and so there exists a positive integer 
n such that b n G Q , whence Q is primary. We now show that P — y/Q. Let 
b G Then there exists a positive integer n such that b n G Q C P. Let n be 
the smallest positive integer such that b n G Q. If n = 1 , then b G P. So assume 
that n >2. Then 66 n_1 G Q and b n ~ l £ Q implies that 6 G P. Hence, -y/Q Q P 
and so P = y/Q. ■ 

We now show that every primary ideal I of a commutative ring R can be 
characterized with the help of some properties of the quotient ring R/I. 

Theorem 17.1.26 Let R be a com,mutative ring and I be an ideal of R. Then 
I is a prim,ary ideal if and only if every zero divisor of R/I is nilpotent. 

Proof. First suppose that / is a primary ideal. Let a -t- 7 be a zero divisor 
in R/I. Then there exists an element b + I G R/I, b + I / I , such that 
(a + 7)(6 + 7) = I. Now ab G / and b £ I. Since 7 is a primary ideal, it follows 
that a n G I for some positive integer n. Hence, (a + 7) n — a n +1 = 7, showing 
that a + 7 is nilpotent. 

Conversely, suppose that every zero divisor of R/I is nilpotent. Let a, 6 G R 
be such that ab € I and a (£ I. Then a+ 7 / I. Now (a+ 7) ( 6 + 7) = a& + 7 — I. 
If b + 7 — I, then b G 7. Suppose b + 7 / I. This implies that 6 + 7 is a zero 
divisor and so is nilpotent. Therefore, there exists a positive integer n such 
that b n + 7 = (b + 7) n = 7. Thus, b n G 7. Consequently, 7 is a primary ideal. ■ 

Consider Z. For the prime factorization of an integer n, n = pf 1 ■ ■ -p /, we 
have 

( n ) = (pi 1 ) ■ ■ ■ (p e s s ) = ( P e f) n ■ • • n {; p e s s ) 

and yj(pf) = (pi) , i = 1, 2,..., s. However, in the polynomial ring Z[x, y\ , it 

can be shown that the ideal (x 2 , xy , 2 ) is an intersection of primary ideals, but 
not a product of primary ideals. These concepts involving prime and primary 
ideals are used in the study of nonlinear equations. For example, consider the 
following nonlinear equations: 


x 2 — y — 0 

x 2 z — 0. 


In the polynomial ring R[x,y], let 7 — (x 2 —y,x 2 z). It can be shown that 
(x 2 — y, z) and (x 2 , y) are primary ideals and that 7 = (x 2 , y ) O (x 2 — y,z) . In 
fact, it can be shown in any polynomial ring F[x\,..., x n ] over a field F that 
every ideal is a finite intersection of primary ideals. This latter result is a type 
of fundamental theorem of arithmetic for ideals. It can also be shown that 
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y/(x 2 — y , z) = (x 2 — y, z) and y/Jx^y) = (x,y) . The solution to the above 
system of equations is 

{(x, x 2 , 0) | x G R} U {(0, 0, z) | z E R}. 

The ideal (x 2 — y,z ) corresponds to {(x,x 2 ,0) | x E i?}, while the ideal (x,y) 
corresponds to {(0,0, z) | z E R}. 

We conclude this section by mentioning the following differences between 
the ideals of Z and Z[x]. 

1. In the ring Z, every ideal is a principal ideal, but in Z[x] there exist ideals 
(for example, (x, 2 )), which are not principal. 

2 . In the ring Z, a nontrivial ideal is a prime ideal if and only if it is a 
maximal ideal. In the ring Z[x], there are prime ideals (for example (x)), 
which are not maximal. 

3. In the ring Z, a nontrivial ideal I is a primary ideal if and only if I = 
(p n ) for some prime p and for some positive integer n. Hence, in Z, if I 
is a primary ideal, then I is expressible as some power of its associated 
prime ideal. In Z[x], this is not true, as (x,4) is a primary ideal with 
(x,2) as its associated prime ideal, but (x,4) ^ (x,2) n for any n > 1. 


17.1.1 Worked-Out Exercises 

0 Exercise 1 Let R be an integral domain. Prove that if every ideal of R is 
a prime ideal, then R is a field. 

Solution: Let 0 ^ a E R. Then o?R is an ideal of R and hence it is a 
prime ideal. Now a 2 E a 2 R. Since a 2 R is a prime ideal, a € a 2 R. Thus, a — a 2 b 
for some b E R. Then a(l — ab ) = 0. Since R is an integral domain and a/0. 
1 — ab — 0 and so a 6 = 1, proving that a is a unit. Hence, R is a field. 

0 Exercise 2 Let R be a commutative ring with 1 . Suppose that (x) is a 
prime ideal of R[x]. Show that R is an integral domain. 

Solution: Since (x) is a prime ideal R[x)/ (x) is an integral domain. Since 
R[x\/ (x) ~ R, R is an integral domain. 

Exercise 3 Let R be a commutative ring and I be an ideal of R. Let P be a 
prime ideal of I. Show that P is an ideal of R. 


Solution: Let a E P C I and r E R. Then rar E I. Therefore, a(rar ) E P 
and so ( ar ) 2 E P. Since P is a prime ideal of /, ar E P. Hence, P is an ideal of 
R. 
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Exercise 4 Show that a proper ideal I of a ring R is a maximal ideal if and 
only if for any ideal A of R either ACIotA + I = R. 

Solution: Suppose I is a maximal ideal of R and let A be any ideal of R. 
If A $7 7, then A + I is an ideal of R such that I C A A I. Since I is maximal, 
it follows that A + I = R. 

Conversely, assume that the proper ideal I satisfies the given condition. 
Let J be an ideal of R such that I C J. Now J <2 I- Therefore, I -f- J = R. But 
I + J — J. Thus, J = R. Hence, 7 is a maximal ideal of R. 

>0 Exercise 5 Let R be a PID which is not a field. Prove that any nontriv¬ 
ial ideal I of R is a maximal ideal if and only if it is generated by an 
irreducible element. 

Solution: Since R is not a field, there exists an element 0 ^ a E R such 
that a is not a unit. Then (0) C (a) C R. Therefore, (0) is not a maximal 
ideal. Let I be a maximal ideal of R. Then 7 7 ^ {0} and I — ( p ) for some p E 
R, where p is irreducible by Theorem 17.1.5 and Corollary 15.3.13. Conversely, 
let I = (p) and p be irreducible. Let 7 C J C R. Since R is a PID, J — (a) for 
some a E R. Since p E (a) , a divides p. Thus, p = ab for some b E R. Since p is 
irreducible, either a is a unit or 6 is a unit. If b is a unit, then a = pb~~ l E (p) . 
Thus, J C 7, which is a contradiction. Hence, a is a unit and so J — R. Thus, 
7 is a maximal ideal. 

0 Exercise 6 Show that the ideal { x) in Z [ 2 ] is a prime ideal, but not a 
maximal ideal. 

Solution: Let f(x) = ao-t-aicr-H • • + a n x n and g{x ) = bo + b\x-\ -h bmP 71 

be two elements in 7i[x] such that f{x)g{x) E (x) . Then ao&o = 0. Thus, either 
a 0 = 0 or bo = 0. Hence, either f{x) E (x) or g{x) E (x ), showing that (x) 
is a prime ideal. Now the ideal (x,2) of Z[x] is such that (x) C {x,2) C Z[x\. 
Hence, ( x ) is not a maximal ideal. 

<0 Exercise 7 Let R be a commutative ring with 1. Let A and B be two 
distinct maximal ideals of R. Show that AB = A fl B. 


Solution: Since AB C A and AB C B, AB C A fl B. Since A and 
B are distinct maximal ideals, there exists b E B such that b A. Then 
(A, b) = {a -I- 6r | a E A, r E R} is an ideal of R such that A C { A , b ). Since 
A is maximal, { A , b) = R. This implies that 1 = a + br for some a E A and 
r E R. Let x E A fl B. Then x = xl = xa 4 - xbr = xa 4 - ( xb)r E AB. Hence, 
A n B C AB. Thus, AB — An B. 


0 Exercise 8 Let f{x) = x 5 + 12a: 4 + 9a: 2 4- 6. Show that the ideal 7 = (/( x)) 
is maximal in Z[a:]. 
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Solution: I will be a maximal ideal if we can prove that f(x) is an irre¬ 
ducible polynomial in Z[x]. The content of f(x) is 1. Hence, f(x) is a primitive 
polynomial in Z[x\. Also, for the prime 3, we find that 3|6, 3]9, 3| 12, 3 / 1, 3 2 
/ 6 . Hence, f(x) is irreducible in Z[x], by Eisenstein’s criterion. 

0 Exercise 9 (i) Find all maximal ideals of the ring Zq. 

(ii) Find all ideals and all maximal ideals of the ring Z 8 . 

Solution: (i) The mapping (3 : Z —> Zq defined by (3(n) = [n] is a homo¬ 
morphism of Z onto Ze and Ker (3 — 6 Z. If / is any ideal of Zq, then there exists 
a unique ideal A of Z such that Ker [3 C A and (3(A) = I. Now Z, 2Z, 3Z, and 
6 Z are the only ideals of Z which contain 6 Z. Also, (3( Z) = Zq, (3( 2Z) = {[0], 

[2] , Mb £(3Z) = {[0], [3]}, and /?( 6 Z) = {[ 0 ]}. Hence, {[0], [2], [4]} and {[0], 

[3] } are the only maximal ideals of Z 6 since 2Z and 3Z are maximal ideals of 
Z. 

(ii) The mapping (3 : Z —» Z 8 defined by (3(n ) = [n] is an epimorphism of 
rings and Ker (3 — 8Z. Now Z, 2Z, 4Z, and 8Z are the only ideals of Z which 
contain 8Z. Also, (3( Z) = Z 8 , >0(2Z) = {[0], [2], [4], [ 6 ]}, (3(AZ) = {[0], [4]}, and 
(3(8 Z) = {[0]}. Hence, the ideals of Z 8 are Z 8 , {[0], [2], [4], [ 6 ]}, {[0], [4]}, and 
{[0]}. Now {[0]} C {[0], [4] } C {[0], [2], [4], [ 6 ]} C Z 8 . This implies that Z 8 has 
only one maximal ideal, which is {[0], [ 2 ], [4], [ 6 ]}. 

0 Exercise 10 Show that ( x 2 ) is a primary ideal in Z[x] with (x) as its asso¬ 
ciated prime ideal. 

Solution: Let f(x) = ao + ai:r4-- • • + a n x n and g(x) = bo~\-bix-\-- • ■ + b rn x rn 
be two elements in Z[x] such that f(x)g(x) G (x 2 ) and f(x ) ^ (x 2 ) . Then 
f(x)g(x) = x 2 h(x) for some h(x ) G Z[x\. Hence, ao&o — 0 and aob\ + ui&o — 0. 
Since f(x ) ^ (a; 2 ) , it follows that either ao ^ 0 or aj ^ 0. If ao 7 ^ 0, then 
bo = 0 and b\ = 0 and so g(x) € (x 2 ) . If ao = 0, then a\ ^ 0- Hence, 
ao^i + ai 6 o = 0 shows that bo = 0. So we find that 6 q = 0, bob\ + 6160 = 0 
and thus (g(x)) 2 G (x 2 ) . Hence, ( x 2 ) is a primary ideal. Now (x 2 ) C (x) 
and f(x) G \/(x 2 ) if and only if ( f(x)) n G (x 2 ) for some positive integer n. 
This is true if and only if the constant term of f(x) is zero, i.e., if and only if 
f(x) G (x) . 

Exercise 11 Show that a commutative ring R with 1 is isomorphic to a subdi¬ 
rect sum of a family of fields if and only if the intersection of all maximal 
ideals of R is {0}. 


Solution: Suppose R is isomorphic to a subdirect sum of a family of fields 
{F t | i G /}. Then there exists a subring T of such that T = ® s i&I Fi 

and R ~ T. Let a : R — ► T be an isomorphism. Then 7T{ o a : R — ► Fi is an 
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epimorphism for all i E I, where 7 q is the ith canonical projection. Proceeding 
as in the proof of Theorem 13.1.14, we can show that 

PiZlAi = { 0 }, 

where A* ~Ker tti o a for all i € I. Now R/Ai ~ Fi. Since F t is a field, Ai is a 
maximal ideal for all i G /. If A is the intersection of all maximal ideals of R, 
then A C r\i£iA t — {0}. Hence, A = {0}. Conversely, suppose that A = {0}, 
where A = | Mi is a maximal ideal of R}. By Theorem 13.1.14, R is 

monomorphic to the subdirect sum of a family of rings {R/Mi \ i £ J}. Since 
each Mi is a maximal ideal, we find that R/Mi is a field. 

17.1.2 Exercises 

1 . Find all maximal and prime ideals of Zio- 

2 . Prove that / — {(5ra,m) | n,m £ Z} is a maximal ideal of Z x Z. 

3. Find all ideals and maximal ideals of Z k, where p is a prime and A: is a 
positive integer. 

4. Let I = {ao + a\x + • • • + a n x n £ Z[x\ \ 3 divides ao}. Show that I is a 
prime ideal of Z[x\. Is / a maximal ideal? 

5. Let I be an ideal of a ring R. Prove that the following conditions are 
equivalent. 

(i) / is a prime ideal. 

(ii) If a, 6 € R\I, then there exists c £ R such that acb € R\I. 

6 . Let R be a finite commutative ring with 1. Show that in R, every prime 
ideal I ^ R is a maximal ideal. 

7. Let R be a Boolean ring. Prove that a nonzero proper ideal I of R is a 
prime ideal if and only if it is a maximal ideal. ' 

8 . Let R be a ring with 1. Prove that a nonzero proper ideal I of R is a 
maximal ideal if and only if the quotient ring R/I is a simple ring. 

9. Let I be an ideal of a ring R. If P is a prime ideal of the quotient ring 
R/I, prove that there exists a prime ideal J of R such that / C J and 
J/I = P. 

10. Let R be a commutative ring with 1. Prove that there exists an epimor¬ 
phism from R onto some field. 

11 . Let I be an ideal of a ring R with 1. Prove that the quotient ring R/I is 
a division ring if and only if I is a maximal ideal. 
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12. For all r G R, show that I r = {/(x) £ R[x] | /(r) = 0} is a maximal 
ideal of R[x] and R [x]/I r — R. Also, prove that n r eR./ r = {0}. 

13. Consider the polynomial ring K[x\ over a field K. Let a G K. Define the 
mapping 0 a : K[x\ —> K by </> a (/(x)) = /(a) for all f(x) G K[x}. Show 
that <p a is an epimorphism and Ker <p a is a maximal ideal of K[x\. 

14. Let R be a PID. 

(i) Prove that every nonzero nonunit element is divisible by a prime 
element. 

(ii) If {/ n } n6 N is a sequence of ideals of R such that /j C C • • ■ C J n C 
• ■ •, prove that there exists a positive integer n such that I n — I n+ i = - - •. 

(iii) Prove that every nonzero nonunit can be expressed as a finite product 
of prime elements. 

15. Let {/ a } be a collection of prime ideals in a commutative ring R such 
that {I a } forms a chain. Prove that C\ a I a and U Q I a are prime ideals of 
R. 

16. If I\ and I 2 are ideals of a commutative ring 1, prove that \fT\ Pi I 2 = y/T[ 

17. Let R be a commutative ring with 1 and Qi , i = 1,2be ideals 
in R. Set Q = n™ =l Qi. Prove that if y/Qi = P for some ideal P of R , 
i = 1,2,..., n, then y/Q — P. If \fQ% = P, i = 1, 2,...,n, and each Qi is 
primary, prove that Q is primary. 

18. If I is an ideal of a commutative ring R with 1 such that \fl is a maximal 
ideal, prove that I is a primary ideal. 

19. In the polynomial ring Z[x], prove the following. 

(i) I = {f(x) G Z[x] | the constant term of f(x) is divisible by 4} is a 
primary ideal with J = (x,2) as its associated prime ideal. 

(ii) The ideal (x, 6) is not a primary ideal. 

20. Prove that every prime ideal is a primary ideal in a commutative ring. 

21 . Let M be an ideal of a commutative ring R. Prove that R/M is a field 
if and only if M is a maximal ideal and x 2 G M implies x G M for all 
x G R. 


22. Prove that in a PID every nontrivial ideal / can be expressed as a fi¬ 
nite product of prime ideals I = P\ • • • P n such that Pi, P 2 , ..., P n are 
determined uniquely up to order. 
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23. An ideal P of a ring R is called a semiprime ideal if for any ideal I of 
R, I 2 C P implies that / CP. 

(i) Prove that an ideal P of R is a semiprime ideal if and only if the 
quotient ring RfP contains no nonzero nilpotent ideals. 

(ii) If R is a commutative ring with 1, prove that an ideal P of R is a 
semiprime ideal if and only if y/P = P. 

24. A commutative ring R with 1 is called a local ring if R has only one 
maximal ideal. Prove the following. 

(i) Zg and Z 9 are local rings. 

(ii) In a local ring, all nonunits form a maximal ideal. 

(iii) In a local ring R , for all r, s € R, r + s — 1 implies either r is a unit 
or s is a unit. 

25. Let p be a prime integer and Q p = {| 6 Q | p does not divide 5}. Show 
that Q p is a local ring under the usual addition and multiplication of 
rational numbers. 

26. Let R be a field and T be the set of all sequences {a n } of elements of 
R. Then (T, +, •) is a ring, where + and • are defined as in Worked-Out 
Exercise 6 (page 358). Prove the following. 

(i) The set I of all nonunits of T is a maximal ideal of T. 

(ii) I is the only maximal ideal of T. 

(iii) T is a local ring. 

27. Let R = R\ 0 R 2 © • • • © Rn be the direct sum of the finite family of 
rings {Ri, R 2 ,..., Rn}, where each Ri contains an identity. Prove the 
following: 

(i) If Mi is a maximal ideal of Ri (1 < i < n), then R\@ R 2 © • • • © Ri-i 
0 Mi 0 Ri+i 0 • • - ®Rn is a maximal ideal of R. 

(ii) Every maximal ideal M of R is of the form 

R\ 0 0 • • • 0 Ri— 1 0 Mi 0 Ri+l 0 • * • 0 Rn, 

where Mi is a maximal ideal of Ri for some i (1 < i < n). 

28. Show that the ring Z is isomorphic to a subdirect sum of a family of 
fields. 


29. 


An ideal / of a ring R is called a minimal ideal if / ^ {0} and there does 
not exist any ideal J of R such that {0} 7 ^ J C I. If I is a minimal ideal 
of a commutative ring R with 1, prove that either 1 2 = {0} or I = eR 
for some idempotent e £ R. 
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30. In the following exercises, write the proof if the statement is true; other¬ 
wise, give a counterexample. 

(i) Let R be a commutative ring with 1 and P be a prime ideal of R 
such that P 7 ^ R. If the quotient ring R/P contains a finite number of 
elements, then R/P is a field. 

(ii) In a PID, there exists a prime element. 

(iii) In a PID, every proper prime ideal is a maximal prime ideal. 

(iv) The intersection of two prime ideals of a ring R is a prime ideal of 
R. 

(v) If I is a prime ideal of a ring R , then I[x] is also a prime ideal of #[ 2 ]. 

(vi) If 7 is a maximal ideal of a ring R, then I[x] is also a maximal ideal 
of R[x}. 

(vii) A commutative ring with 1 and with only a finite number of maximal 
ideals is a field. 

(viii) In the ring Z, the ideal (5) is a maximal ideal, but in the ring Z[i], 
the ideal (5) is not a maximal ideal. 


17.2 Jacobson Semisimple Ring 

In this section, we introduce an interesting class of commutative rings and give 
a simple characterization of this class. 

Throughout the section, we assume that R is a commutative ring with 1. 

Definition 17.2.1 The Jacobson radical of a ring R, denoted by radR , is 
the set 

radR = 0{M | M is a m,axim,al ideal of R}. 

Since the ring R contains 1, maximal ideals in R exist and thus rad/2 is 
well defined. 

The following theorem gives a characterization of rad R with the help of 
elements of radiL , 

Theorem 17.2.2 Let y £ R. Then y E radR if and only if 1 — xy is a unit in 
R for all x E R. 


Proof. Suppose y E rad R and there exists an element x E R such that 
1 — xy is not a unit. Then from the Corollary 17.1.16, there exists a maximal 
ideal M of R such that 1 — xyE M. Since y E rad R, y E M. Therefore, xy E M. 
This implies that 1 = 1 —xy + xyE M, which is a contradiction. Hence, 1 — xy 
is a unit. 
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Conversely, assume that 1 — xy is a unit in R for all x £ R. Suppose y ^ 
radii. Then there exists a maximal ideal M of R such that y M. Consider 
(M,y) = {m + ry \ m £ M, r £ R}, the ideal generated by M U {y}. Clearly 
M C (M, y) . Since M is a maximal ideal, it follows that (M, y) = R. Hence, 
1 = m + ry for some m £ M and r £ R. Thus, 1 — ry = m £ M. By the 
hypothesis, m = 1 — ry is a unit. Hence, 1 = mm,~ l £ M. This implies that 
M — R, which is a contradiction. Hence, y £ radii. ■ 

Corollary 17.2.3 0 is the only idempotent element in radR. 

Proof. Let y £ radii and y be an idempotent. Now 1 — y — 1 — ly is a 
unit. Hence, there exists u £ R such that (1 — y)u = 1. Then y = y(l — y)u = 
(y - y 2 )u = (y - y)u = Ou. = o. ■ 

Corollary 17.2.4 radR contains all nil ideals of R. 

Proof. Let I be a nil ideal of R. Now every element of i is nilpotent. Hence, 
for all a £ i, r £ R, ar is nilpotent. This implies that 1 + ar is a unit for all 
r £ R by Exercise 11 (page 282). Hence, a £ radii. Thus, i C radii. ■ 

Theorem 17.2.5 radR is an ideal of R and rad(R/radR) = {0}, where 0 — 
0+radii. 

Proof. Since the intersection of a family of ideals of R is an ideal, radii is an 
ideal of R. Denote I — radii. Now i ^ R and R/I is a commutative ring with 
identity. Let a+i £ rad (R/I) and x+I £ R/I. Then 1 +i—(i+i)(a+i) is a unit 
in R/I. Thus, there exists u+I £ R/I such that (1 + i — (x+I)(a+I))(u+I) = 
1 + I or (1 — xa + I)(u + I) — 1 + i. Hence, 1 — (1 — xa)u £ I and so by The¬ 
orem 17.2.2, 1 — (1 — (1 — xa)u) is a unit in R, i.e., (1 — xa)u is a unit in R. 
This implies that 1 — xa is a unit in R for all x £ R. Therefore, a £ I and so 
0 + 1 = 1. Consequently, rad(ii/radii) = {0}. ■ 

Let us now consider those commutative rings with 1 for which radi? = {0}. 

Definition 17.2.6 R is called a Jacobson semisimple ring (J-semisimple 
ring) if radR = {0}. 

Example 17.2.7 (i) For any ring R , the quotient ring R/radR is J~semisim,ple 
by Theorem, 17.2.5. 

(ii) The ring Z of integers is J-semisimple. In Z, the m,axim,al ideals are 
of the form (p ), where p is a prim,e. Let n £ radZ. Then p\n for all p £ Z, p 
is prim.e. Since Z is a UFD, n = 0. Hence, radZi = {0). . __ 
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(in) Every commutative regular ring R is J-semisimple. Let a E radR. Then 
there exists b E R such tha.t a = aba. Now ab E radR and ( ab ) 2 = abab — ab. 
By Corollary 17.2.3, ab = 0. Hence, a — aba — 0. Thus, radR = {0}. 

(iv) Every field is J-semisimple. 

(v) Every polynomial ring F[x ] over a fi.eld F is J-semisimple. 

Theorem 17.2.8 A commutative ring R with 1 is J-semisimple if and only if 
it is isomorphic to a subdirect sum of a family of fields. 

Proof. Suppose R is J-semisimple. Let {Mi j i E 1} be the collection of all 
maximal ideals of R. Then rad R = n^jMi = {0}. Hence, by Theorem 13.1.14, 
R is isomorphic to a subdirect sum of a family {R/Mi \ i E 1} of rings. But 
RjMi is a field for all i E I. Hence, R is isomorphic to a subdirect sum of a 
family of fields. Conversely, suppose that R is isomorphic to a subdirect sum of 
a family of fields {Fi | i E /}. Then there exists a family of ideals Mi such that 
each F{ is isomorphic to R/Mi and D i^iMi = {0}. Since each Mi is a maximal 
ideal of R and D 1 ^jM 1 = {0}, it follows that radi? C = {0}. Thus, R is 

J-semisimple. ■ 


17.2.1 Worked-Out Exercises 

0 Exercise 1 Find radZi 2 - Is the ring Z 12 a J-semisimple ring? 

Solution: The mapping /5 : Z —> Z 12 defined by f3{n) = [n] is an epi- 
morphism of rings and Ker (3 = 12Z. Now Ker (3 is contained in the ideals, 
Z, 2 Z, 3Z, 4Z, 6 Z and 12 Z. Also, (3{ Z) = Z 12 , (3( 2Z) = {[ 0 ], [2], [4], [ 6 ], [8], 
[10]}, 13(32) = {[0], [3], [ 6 ], [9]}, /3(4Z) = {[0], [4], [ 8 ]}, /?( 6 Z) = {[0], [ 6 ]}, and 
K 12Z) = {[0]}. Hence, I = {[0], [2], [4], [ 6 ], [ 8 ], [10]} and J = {[0], [3], [ 6 ], 
[9]} are the only maximal ideals of Z 12 . Now radZi 2 = I C\ J = {[0], [ 6 ]}. Since 
radZi 2 ^ {[0]}, Z 12 is not J-semisimple. 

<) Exercise 2 Is the ring Z 15 a J-semisimple ring? 


Solution: Proceeding as in Worked-Out Exercise 1, we can show that 
I = {[0], [3], [ 6 ], [9], [12]}, J = {[0], [5], [10]} are the only maximal ideals of 
Z 15 . Now radZis = / fl J = {[0]}. Hence, Z 15 J-semisimple. 


Exercise 3 Let R be a commutative ring with 1 and A be an ideal of R such 
that A C radi?. Show that rad (R/A) = rad R/A. 


Solution: Let b+A E rad(i?/A). Let r E R. Then (1-f-A) — (b+A)(r+A) is 
a unit. Hence, there exists d+A E R/A such that ((1 — br) +A)(d + A) = 1 -\-A. 
This implies that 1 — [d — dbr) E A C radJ?. Hence, 1 — (1 — {d — dbr )) is a unit in 


R, i.e., d( 1 — br) is a unit in R. Thus, 1 — br is a unit in R for all r E R. Hence, 
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b 6 racLR, and so b + A € rad R/A. Thus, rad(.R/A) C radR/A. Now let 6 +A € 
radR/A, where b <G radR. Then, 1 — be is a unit for all c € R. Let c E R. Now 
there exists d E R such that (1 — bc)d = 1. Thus, ((1 — be) + A)(d + A) — 1 + A 
in .R/A, i.e., (1 + A) — (6 + A)(c + A) is a unit in R/A. Hence, b + A E rad(R/A). 
Thus, radR/A C rad(R/A). Hence, rad(R/A) = rad R/A. 

17.2.2 Exercises 

1. Prove that the ring Z n , n > 1 , is J-semisimple if and only if n is a square 
free integer. 

2. Is the ring Zio a J-semisimple ring? 

3. Let R be a PID. If R has an infinite number of maximal ideals, prove 
that R is J-semisimple. 

4. Let R — R\ x R 2 be the direct product of two commutative rings R\ and 
R 2 with 1. Prove that radR = radRi x radR 2 . 

5. Let R(R) = {/ | / : R —> R}. R(R) is a commutative ring with 1 , where 
+ and • are defined by 

if+ 9 ) 0) = f(x) + g(x) 

(/ ' g)i x ) = f{x)g(x) 

for all /, g 6 ^(R) and for all x G R. Let t G R. 

(i) Show that /* = {/€ JF(R) | f(t) = 0} is a maximal ideal and 
EteB.lt — { 0 }- 

(ii) Prove that ^P(R) is a J-semisimple ring. 

6 . Which of the following statements are true? Justify your answer. 

(i) If Fi and F 2 are two fields, then F\ x F 2 is a J-semisimple ring, but 
not a field. 

(ii) If R\ and R 2 are two J-semisimple rings, then x R 2 is a J- 
semisimple ring. 

(iii) A J-semisimple ring may contain a nonzero nil ideal. 

(iv) In a commutative ring R with 1 , for any two ideals A and B , AB = 
{0} may not imply AnB = {0}, but in a J-semisimple ring this is always 
true. 





Chapter 18 

Noetherian and Artinian 
Rings 


In Hilbert’s work on invariant theory is the result that in certain polynomial 
rings, every ideal is finitely generated. Lasker, a student of Hilbert and a former 
world chess champion, showed that in certain polynomial rings, every ideal is 
a finite intersection of primary ideals. Noether generalized Lasker’s result to 
commutative rings in which any strictly ascending chain of ideals is finite. 

18.1 Noetherian and Artinian Rings 

In the present section, we introduce two special classes of rings—Noetherian 
rings and Artinian rings. Noetherian rings satisfy an ascending chain condition 
of ideals, whereas Artinian rings satisfy a descending chain condition of ideals. 
We first define these two properties of ideals. 

Definition 18.1.1 A ring R is said to satisfy the ascending chain con¬ 
dition ( ACC) for left (right) ideals if for each sequence of left (right) ideals 
Ai, A2, ... of R with A\ C A2 C ■ • •, there exists a positive integer n (depending 
on the sequence ) such that A n — A n +1 — • • •. R is said to satisfy the descend¬ 
ing chain condition (DCO) for left (right) ideals if for each sequence of left 
(right) ideals Aj, A2, ... of R with A\ D A2 2 ■ • ■, there exists a positive integer 
n (depending on the sequence) such that A n — A n + 1 = • • • . 

Clearly the ACC on left ideals- is equivalent to the statement that any 
sequence of left ideals Ai, A 2 ,... of R such that A\ C A 2 C ■ ■ ■ must be finite. 
A similar equivalence holds for the DCC. 

Let X be a nonempty set of left ideals of a ring R. Then (. X , <) is a partially 
ordered set, where < is defined by the set inclusion relation. This partially 
ordered set may have a maximal element, i.e., there may exist an element 
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A € I such that A is not contained in any other element of X. Also, this 
partially ordered set may contain a minimal element, i.e., an element of X that 
does not contain any other element of X. Considering all these conditions, let 
us introduce the following conditions on a ring. 

Definition 18.1.2 A ring R is said to satisfy the maximal condition (MC) 
for left (fi.ght) ideals if in any nonempty set of left (right) ideals of R, there 
exist som.e left (right) ideal which is maximal in the set, i.e., not contained in 
any other ideal of the set. R is said to satisfy the minimal condition (mCT) 
for left (right) ideals if in any nonempty set of left (right) ideals of R , there 
exist som.e left (right) ideal which is minimal in the set, i.e., does not contain 
any other left, (right) ideal of the set. 

Example 18.1.3 The ring 7i of integers satisfies the m,axim,al condition for 
ideals, but does not satisfy the minim,al condition for ideals. Let X be a,ny 
nonempty collection of ideals ofX. Let A\ E X. Then there exists a nonnegative 
integer n such that A\ = (n) . If A\ is not m.axim.al, then there exists an ideal 
A 2 — (m) such that A\ C A 2 . Then m ft n and m, divides n. Again, if A 2 
is not m.axim.al, then there exists an ideal A 3 = (r) such that A 2 C A 3 . Then 
r A m, r A n, and, r divides m, and, n. If A 3 is not maximal, then we repeat 
this process. Since Z is a UFD, n has finitely many distinct divisors. Hence, 
the above process m,ust terminate after finitely m,a,ny steps. Thus, X contains a 
m,a,ximal element. Consider the set J = {mZ | m in a positive even integer} of 
ideals of Z. For any mZ E J 2mZ E J , and mZ D 2mZ. Therefore, it follows 
that J has no minimal elem.ent. 

Theorem 18.1.4 In any ring R, the following conditions are equivalent. 

(i) R satisfies the ACC for left ideals. 

(a) R satisfies the MC for left ideals. 

(Hi) Every left ideal of R is finitely genera,ted. 


Proof. (i)=^(ii): Let A be any collection of left ideals of R. Let A\ E A. 
Then A\ is either maximal in A or there exists a left ideal A 2 E A such that 
Ai C A 2 . If Ai is maximal in A, then we have proved the assertion. If A\ is 
not maximal in A, then either A 2 is maximal in A or there exists A 3 E A such 
that A 2 C A 3 . By the ACC, this process must terminate in a finite number of 
steps, say, n steps. Then A n is maximal in A. 

(ii)=>(iii): Let A be any left ideal of R. Let ai E A. Then either (a\) l = A 
or (a\) l C A. If (a\) l = A, then A is finitely generated. Suppose (ai)/ C A. 
Let a 2 E A and 02 ^ (ai)/. Then (a\) l C ( 01 , 02 )/ and either ( 01 , 02 )/ = A or 
( 01 , 02 )/ C A. If ( 01 , 02 )/ = A, then A is finitely generated. If ( 01 , 02 )/ C A, 
then we continue this process. If in a finite number of steps, say, n, we obtain 





18.1. NOETHERIAN AND ARTINIAN RINGS 


408 


(ai, < 22 ,..., a n )i — A, then A is finitely generated. If this is not the case, then 
there exist elements a. 2 ,... in A such that 


(ai) f C (< 2 i, (22); C • • • C (<21,02,..., a n ) l C • • •. 

In this case the set A = {(01)/, (&i, &2/), • • •, {<21, 0,2, ..., a n ) { ,...} is a col¬ 
lection of left ideals of R which does not have a maximal element. However, 
this contradicts our assumption that R satisfies (ii). 

(iii)=^(i): Let A\, A 2 , ... be any sequence of left ideals of R such that 


Ai C A 2 C • ■ •. 


Then A = Uis a left ideal of R and is finitely generated, say, A = 
(ai, a 2 ,..., a n ) l . Now a,j E Ai- for some Ai j , j = 1, 2,..., n. Let k be the 
maximum of ..., i n . Then 01 , 02 , ..., a n E Ak. This implies that A = 
(ai, 02 ,..., a n ) l C Ak C A. Hence, we must have that A — Ak- Thus, for any 
positive integer i, Ak+\ 3 A^ — A D A^i- Consequently, 

A\ C A 2 C • • • C Ak — Ak+\ = • • • = A. ■ 

Corollary 18.1.5 ^4ny principal ideal ring satisfies the ACC. ■ 

Corollary 18.1.5 provides us with many examples of rings satisfying the 
ACC. For instance, Z and the polynomial ring F[x\ over a field F satisfy the 
ACC since they are principal ideal rings. 

Definition 18.1.6 A ring which satisfies the ACC for left (right) ideals is 
called a left (right) Noetherian ring. 

A ring which is both left Noetherian and right Noetherian is called a 

Noetherian ring. 

The following theorem follows from Theorem 18.1.4. 

Theorem 18.1.7 In any ring R , the following conditions are equivalent. 

(i) R is a left Noetherian ring. 

(n) R satisfies the MC for left ideals. 

(Hi) Every left ideal of R is finitely genera,ted. M 

Example 18.1.8 (i) A principal ideal ring is a Noetherian ring. 

(ii) A polynomial ring over a field is a Noetherian ring. 

We now study the homomorphic images, quotients, and finite direct sums 
of Noetherian rings. 

Theorem 18.1.9 If R is a left, Noetherian ring, then any homom,orphic im,a,ge 
of R is a left Noetherian ring. 
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Proof. Let R be a left Noetherian ring and / : R —» S be an epimorphism 
of rings. Let 

Ji C J 2 C ■ ■ • 

be any ascending chain of left ideals of S. Let R = f~ l (Jk ) f° r all k > X. Then 
Ik is a left ideal of R for all k and I\ C I2 C • ■ ■. Since R is left Noetherian, there 
exists a positive, integer n such that I n — I n + t for alH > 1. Let y £ J n +i , i > X. 
Since / Is onto, there exists x £ R such that f{x) — y. Then x £ I n +i = I n 
and so y £ J n . Therefore, J n = J n +i for all i > 1 , proving that S is left 
Noetherian. ■ 

Theorem 18.1.10 Let I be an ideal of a ring R. If I and R/I are both left 
Noetherian rings, then R is left Noetherian. 

Proof. Let A\ C A 2 C ■ • ■ be an ascending chain of left ideals in R. Let 
0 : R —> R/I be the natural homomorphism of R onto R/I. Then 0(Ai) C 
0(^4.2) Q • • • is an ascending chain of left ideals in R/I. Since R/I is left 
Noetherian, there exists a positive integer n such that 0(A n ) = 0(A n +*) for 
all i > X. Also, A\ D / C A 2 H I C • ■ • is an ascending chain of left ideals 
in I. Since I is left Noetherian, there exists a positive integer m such that 
Am H I — Am+i H / for alM > 1. Let k be the larger of m and n. Then 
'ip(Ak) = if(A k +i) and A k n I - A k+ i fl I for all i > 1. Let b £ Ak+i • There 
exists x £ Ak such that 0(6) = 0(x), i.e., 6 + / = x + 1. Therefore, b — x £ I 
and also b — x £ Ak+i • This implies that b — x £ Ak+i fl I = Ak D I. Hence, 
b — x £ Ak and so 6 € Ak- Thus, Ak = Ak+i f° r Ail i > 1. Consequently, R is 
left Noetherian. ■ 

Theorem 18.1.11 A finite direct sum, of left Noetherian rings is left Noethe¬ 
rian. 


Proof. Let R = #1 © R 2 ® ■ • • © Rn be a finite direct sum of left Noethe¬ 
rian rings. We show the result for n = 2. The general case will follow by 
induction. Let R — R\ ® i? 2 , where R\ and R 2 are left Noetherian. Now 
(Ri£)R 2 )/Ri — R 2 ■ Thus, (Ri(BR2)/Ri is left Noetherian. Since (Ri®R 2 )/Ri 
and R\ are left Noetherian, R\ © R 2 is left Noetherian by Theorem 18.1.10. ■ 


Note: All the results which are established for left Noetherian rings can also 
be proved for right Noetherian rings by simply replacing left ideals with 
right ideals. 


Since every ideal of Z is a principal ideal, Z is Noetherian. Every ideal of Z is 
generated by a single element, but the ideals of Z[x\ may not be principal ideals, 
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i.e., may not be generated by a single element. Interestingly, the ideals of Z[x] 
are finitely generated. Thus, Z[x\ is not a principal ideal ring, but nevertheless 
is a Noetherian ring. This result follows from the following theorem. 

Theorem 18.1.12 If R is a commutative Noetherian ring with 1, then the 
polynomial ring .R[x] is a Noetherian ring. 

Proof. We show that every ideal A of i?[i] is finitely generated. For each 
integer n > 0, let I n be the set of all a G R such that either a = 0 or a is the 
coefficient of x n of a polynomial f{x) £ A of degree n. Suppose a,b £ I n and 
a ^ 0, b 7 ^ 0. Then there exist f(x),g(x) £ A such that deg f(x) = deg g(x) = n 
and a is the coefficient of x n in f(x) and b is the coefficient of x n in g{x). If 
a — b = 0, then a — b £ I n • Assume a - 6 / 0. Now a — b is the coefficient 
of x n of f{x) — g{x) £ A. Therefore, a — b E I n . Also, for r e R, if ra 0, 
then ra is the coefficient of x n of rf(x ) G A and so ra G I n ■ Hence, I n is 
an ideal of R for n > 0. We now show that I n C J n+1 . Let a be a nonzero 

element of I n . There exists f(x) = ao + a\x H-+ a n -\x n_1 + ax n G A. Then 

xf{x) = aox + a\x 2 + • • • + a n _ \x n + ax n+l G A. Therefore, a G I n +i and so 
In Q I n+ 1 - Thus, we obtain an ascending chain 

Io'Q h c h c ... 

of ideals of R. Since R is Noetherian, there exists an integer m such that 
I m = I n for all n > m. Again, every ideal of a Noetherian ring is finitely 
generated. Hence, each of the ideals 7o,/i,..., Im is finitely generated. Let 

Ik = ( a fcl) @'k2i ■ • • j ) 

for k = 0,1,..., m, where a^j is the leading coefficient of fkj (x) G A, a poly¬ 
nomial of degree k. Note that aoj = foj(x) for k = 0 and j — 1, 2,... ,tk are 
the polynomials of degree 0 in R[x]. Let 


S = {fkj{x) | 0 < k < m, 1 < j < t k } 

and B be the ideal generated by S. Then 

B = (foi(x),...,fot 0 (x),...,f Tn i(x),...,f rntm (x)) C A 

since each fkj(x) £ A. Next we show that A C B. We prove this by induction 
on the degree of the polynomials in A. Let f(x) be any polynomial of degree 
0 in A. Then f(x) € Io C B. Hence, any polynomial of degree 0 in A is also 
in B. Now assume that any polynomial of degree less than r in A is also in B. 
Consider a polynomial 


f ( 3 ?) — 60 T b\X -(- • ■ • T b T x G A 

~Pu*uc- “Phy-SALc-A. 
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with b T 7 ^ 0. If r < m , then b T £ I T and hence b r — c\a r \ + C 2 a r 2 + • ■ • + c T a T t r 
for some ci, c 2 ,..., c r £ R. Hence, the polynomial 

hr(x) = Cifri(x) + C 2 fr 2 (x) H-h C r f rtr (x) € B 

and the coefficient of x f of this polynomial is b r 0. Thus, h r {x) is of degree 
r with b T as the coefficient of x T . Therefore, f{x) — h r [x) is a polynomial of 
degree less than r and f(x) — h r (x) £ A. Thus, by the induction hypothesis, 
f{x) — h r {x) £ B. But h T {x) £ B. Consequently, f(x) £ B. Hence, by induction, 
if f{x) £ A with deg f(x) < m, then f(x) £ B. If m < r, then b r £ I r = I m = 
(a m i, a m2 , • • •, ^mtm) ■ Therefore, there exist d m2 ,..., £ R such that 

br — hjnYCLjni T d Tn 2Q'm2 T * ‘ ' T d rri f fTn CL Tn f Tn . 

This implies that the polynomial 

h 2 {x) = f(x) - x r ~ m (d Tnl f Tn i(x ) + d m 2 /m 2 (z) -4-h d m i m / mtrn (a:)) (18.1) 

is a polynomial in A of degree less than r. Hence, by the induction hypothesis, 

h 2 (x) £ B. But dmifmi(x) + d m 2 fm 2 (x) H-b d m t m fmt m {x ) € B. From Eq. 

(18.1), it follows that f{x) £ B. Therefore, by induction, if f(x) £ A with 
deg/(x) > m, then f(x ) £ B. Consequently, we find that A = B, proving that 
A is finitely generated. ■ 

Corollary 18.1.13 (Hilbert Basis Theorem) Let R be a comm,utative ring 
with 1. If R is a Noetheria.n ring, then the polynomial ring R[x i,... ,x n ] is a 
Noetheria,n ring. M 

If F is a field, then F is clearly Noetherian since it has only two ideals. 
Thus, the polynomial ring F[x i,..., x n ] is a Noetherian ring. 

Thus, we find that the Hilbert basis theorem gives us a wide class of Noethe¬ 
rian rings. 

We now introduce another class of rings called Artinian rings. First we note 
the following equivalence. 

Theorem 18.1.14 In any ring R , the following conditions are equivalent. 

(i) R satisfies the DCC for left ideals. 

(ii) R satisfies the mC for left ideals. 

Proof. The proof is similar to the proof of Theorem 18.1.4. We leave the 
proof as an exercise. ■ 

Example 18.1.15 Z does not satisfy the DCC since (2) D (4) D ( 8 ) D • • • is 
an infinite chain with (2 n ) D (2 n+1 ) , n = 1, 2,.... The polynomial ring F[x) 
does not satisfy the DCC since (x) D (x 2 ) D ( x 3 ) D ■ • • is an infinite chain 
with (; x n ) D (z n+1 ), n= 1, 2,.... 
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Example 18.1.16 Let R be the ring of Example 10.1.8. For r £ R, define 
7 r = {/ G R | f(x) = 0 for all — r < x < r}. 

Now I T is an ideal of R and I r C It if t < r. Therefore, 

I\ D I2 I3 2 ) •' • 

is an infinite strictly descending chain of ideals and 


I\ C /l C Ii c • • • 

2 3 

is a.n infinite strictly ascending chain of ideals of R. Hence, R satisfies neither 
the ACC nor the DCC. 

Definition 18.1.IT A ring which satisfies the DCC for left (right) ideals is 
called a left (right) Artinian ring. 

A ring which is both left Artinian and right Artinian is called an Artinian 
ring. 

By Theorem 18.1.14, it follows that a ring R is left Artinian if and only if 
the mC holds for left ideals in R. 


Example 18.1.18 Let p be a fixed prime and 


Z(p°°) = {-^eQ|0<a<p n , n € N 


Then (Z(p°°),+,-) is a commutative ring without identity, where + (addition) 
is m.odulo 1 and a ■ b = 0 for all a,b £ Z(p°°). From, the definition of multi¬ 
plication, it follows that every subgroup of (Z(p°°),+) is an ideal. Hence, the 
ideals of Z(p°°) are precisely the subgroups of (Z(p°°), +). 

Let I be a nontrivial ideal ofZ(p°°). Let k be the smallest positive integer 
such that ^ I for som,e integer q, 0 < q < p k . If p\q, then ^ I for som,e 

integer a, 0 < a < p k ~ l , contrary to the choice of k. Therefore, gcd(p, q) = 1. 
Now 

f i 2 p k ~ l — 1 } 

J ~ j pk -1 ’ pk -1 ’ ■ • ■ ’ -pk-1 r 


is a subset of I. Let us show that I — J. 

Consider the rational number where gcd (p,r) = 1 and n > k. Suppose 

that ^ 6 /. Since gcd(p, r) = 1 , there exist integers x and y such that rx+py = 
1. Now 


xr 



( xp n k )r 


P. 


71 


and 
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py _ y 

pk p k ~ l 

(both the numbers are reduced m,odulo 1) are in I. Hence, 

_ xr + yp e j 

pk pk 

This is contrary to the choice of k. Hence, I — J — { 0, ^n, ^r, . - - , 

We denote this ideal by 1^. It is also clear that for any positive integer R is 
an ideal ofZ>(p°°). These ideals form, the following strictly ascending chain 

{0} c h c h C • • • C I k c • • • 

in Z(p°°), proving that Z (p°°) is not Noetherian. Since every proper ideal is 
finite, every descending chain of ideals must be finite. Therefore, Z (p°°) is 
Artinian. 


It is known that any finite ring with more than one element and without 
zero divisors is a division ring. The following theorem generalizes this result. 

Theorem 18.1.19 Let R be a left Artinian ring with, more than one element. 
If R does not contain zero divisors, then R is a division ring. 


Proof. Let 0 ^ a G R. Now 

H 2 (a 2 ), 2 (a 3 ), 2 -.., 

where (a 71 ); is the left ideal generated by a n . Since R is left Artinian, there exists 
a positive integer n such that {a n ) l = ( a n+1 ) [ = ■ ■ •. Therefore, a n G (a n+1 ) z . 
Thus, there exist r € R and m € Z such that 

a n = ra n+1 T ma n+1 . 

Now a 7 ^ 0 and R has no zero divisors. Therefore, a n ^ 0. This implies that 
a ra 2 +ma 2 = (ra+ma)a. Let e — ra+ma. Then bea — ba implies that be = b 
for all b € R and e 2 = e. This also shows that e / 0. Now eb = e 2 b implies 
b = eb. Hence, e is the identity element of R. Now e — ra + ma = (r + me)a. 
This implies that left inverse exists for each nonzero element of R. Hence, a~ l 
exists in R for all nonzero element a G R. Consequently, R is a division ring. I 

Corollary 18.1.20 A commut,a,tive Artinian ring is a field, if and only if it is 
an integral dom,ain. ■ 

We now want to characterize J-semisimple rings which are either Noetherian 
or Artinian. 


Theorem 18.1.21 Let R be a commutative ring with 1. If R is an Artinian 
ring, then rad,R is a nilpotent ideal. 
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Proof. Let J = radiL Now J n is an ideal of R for all positive integers n 
and 

J D J 2 D J 3 D • • • . 

Since R is Artinian, there exists a positive integer n such that J n = J n+1 = • • •. 
Let / = J n . Then 7 2 = I. Suppose that I 7 ^ {0}. Let 

T = {T | T is an ideal of R, TCI and IT ^ {0}}. 

Now I £ T and so T 7 ^ <f>. The minimal property of R on ideals implies that 
T contains a minimal element, say, To. Then To C /, /To 7 ^ {0}, and To is 
minimal in T . Now /To 7^ {0} implies that la 7^ {0} for some nonzero a C Tq. 
Now la is an ideal of R. Also, I(la) = I 2 a — la 7 ^ {0} and la C To C /. 
Thus, la £ T . By the minimality of To, Tq — la. Therefore, there exists b £ I 
such that a — ba. Now b£l = J n CJ = rad R. Thus, 1 — xb is a unit for all 
x £ R and so (1 — 6)" 1 exists in R. As a result, we deduce that a = 0 since 
a(l — b) = 0. However, this is a contradiction. Hence, J n = {0} and so J is 
nilpotent. ■ 

Corollary 18.1.22 Let R be a, commutaiive ring with 1. If R is an Artinian 
ring, then every nil ideal of R is nilpotent. 

Proof. Let / be a nil ideal of R. Then I C rad R by Corollary 17.2.4. Since 
rad R is nilpotent, there exists a positive integer n such that (radi?) n = { 0 }. 
Then I n C (radi?) n = {0}. Hence, I n = {0} and so I is nilpotent. ■ 

Theorem 18.1.23 Let R be a commutative ring with 1 . If R is J-sem,isimple 
Artinian, then R is a direct sum of a finite number of fields. 


Proof. Let T be the collection of all maximal ideals of R. Then T 7 ^ 0. We 
now show that T has only a finite number of elements. Suppose that \T\ = 00. 
Then T contains an infinite set {Mi \ i £ N} of distinct maximal ideals of R. 
Also, 

Mi C M1M2 C MiM 2 M 3 C • • •. 

* 

Since R is Artinian, there exists a positive integer n such that 

MiM 2 ■■■M n = MiM 2 v • M n+ i 

for all i > 1. Therefore, M1M2 ■ ■ • M n C M n+ \. Since M n+ \ is also a prime 
ideal, M x C M n + 1 for some i, 1 < i < n. This contradicts the assumption that 
Mi, M2,..., M n , M n+ 1 are all distinct maximal ideals. Therefore, R has a 
finite number of maximal ideals. Since R is J-semisimple, 

n{M | M is a maximal ideal of R} = {0}. 
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We can find maximal ideals M\, M 2 ,..., M n such that • -HM n = {0}, 

but 

Ii = Mi n m 2 n • ■ • n M^ n M i+1 n • • • n M n ^ {0} 

for all i, 1 < i < n. Thus, Mi n A = {0} for all i, 1 < i < n. Since Mi is 
maximal and Ii 2 M, R + Mi = R for alH, 1 < i < n. Hence, R = R © M t for 
alH, 1 < i < n. This implies that R/Mi ~ R for alH, 1 < i < n. Since R/Mi 
is a field, R is a field for all i, 1 < i < n. Let x E R. Then x = di + for some 
a* E 4 and m.i E Mi, for all i, 1 < f < n. Let y — ai + a 2 + • • • + a n . Then 

x y (x di') Ui fl 2 * —1 d-^-j-i ■ 

■— 1X1,i ( 2 i 02 " ‘ di — 1 d{^. 1 ' d n E Mi 

since E Mi for A; 7 ^ i, 1 < < n. Therefore, x — y £ H^ =1 Mi = {0} and so 

z — y E £?=i 4- This implies that 4 = £" =1 R. Now D fl £"_i R C nM^ — 

i^k 

{0}. Hence, R = ®f =1 4- ■ 

Theorem 18.1.24 Let R be a commutative ring with 1 . If R is J~sem.isim,ple 
Artinian, then R is Noetherian. 


Proof. Let R be a J-semisimple commutative Artinian ring. Then R is 
isomorphic to a direct sum of a finite number of fields. Let R ~ i 7 !®^® • • -®F n , 
where Fi is a field, 1 < i < n. Now each F{ contains only two ideals and hence 
Fi is Noetherian. Thus, R is a finite direct sum of Noetherian rings. Hence, 
by Theorem 18.1.11, R is Noetherian. ■ 

Remark 18.1.25 In this book, we proved Theorem, 18.1.24 for J-sem,isim,ple 
com,m,uta,tive Artinian rings for the sake of simplicity. However, it is known, 
in general, that any Artinian ring with 1 is Noetherian. 


18.1.1 Worked-Out Exercises 

<0> Exercise 1 Show that a subring of an Artinian ring may not be Artinian. 

Solution: The field Q of rational numbers has only two ideals and hence 
is Artinian. The ring Z is a subring of Q, but Z is not Artinian. 

0 Exercise 2 Show that the sum of all nilpotent ideals of a commutative 
Noetherian ring R is a nilpotent ideal. 


Solution: If R is nilpotent, then the result is immediate. Suppose R is 
not nilpotent. Let A = Y^izi Ai be the sum of all nilpotent ideals of R. Then A 
is the ideal generated by U i^iAi. Let T — {Ai | i E /}. Since R is Noetherian, 
T has a maximal element, say, B. Let us show that Ai C B for all i E /. Let 
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A{ E T. Now A z + B is nilpotent and B C Ai + B. Since B is a maximal element 
of T and Ai + B E T, it follows that A* + B = B. Hence, Ai C B for all i E /. 
This implies that A — (U i^iAi) C B. But B — Ak for some k E I. Therefore, 
A = B. Thus, A is nilpotent. 

Exercise 3 Let R be a commutative Noetherian ring with 1. Show that every 
ideal of R contains a finite product of prime ideals. 

Solution: Let T ~ {A | A is an ideal of R and A does not contain any finite 
product of prime ideals of i?}. We show that T — <fr. Suppose T ^ 4>. Then T 
has a maximal element, say, Aq. Now Ao cannot be a prime ideal. Thus, there 
exist ideals B and C of R such that BC C Ao, but B % Ao and C <2 Ao- Now 
Ao C Ao + B and Ao C Ao + C. Hence, Ao + B and Ao + C are ideals of R 
such that Aq + B, Ao + C £ T. Then Ao + B and Ao 4- C contain a finite 
product of prime ideals. This implies that (Ao + B)(Aq T C ) contains a finite 
product of prime ideals. Now (Ao + B)(Ao + C) = AoAo + AoC -F AqB + BC C 
Ao- This implies that Ao contains a finite product of prime ideals, which is a 
contradiction. Thus, T — (f>. Hence, every ideal of R contains a finite product 
of prime ideals. 

Exercise 4 Let / be an epimorphism of a Noetherian ring R onto itself. Show 
that / is an isomorphism. 

Solution: For each positive integer n, f n is an epimorphism and 

Ker / C Ker / 2 C Ker / 3 C • • • 

is an ascending chain of ideals in R. Since R is Noetherian, there exists a 
positive integer m, such that Ker f m — Ker f m+z for all i > 1. Thus, Ker 
/ m = Ker f 771+1 . Let x E Ker /. Since f m is onto R , there exists an element 
y E R such that f m (y ) = x. Now 0 — f(x) = f m+1 (y ). This implies that y E 
Ker f 771+1 = Ker/ m . Hence, x = f m (y ) — 0. Thus, / is one-one and so is an 
isomorphism. 


<0 Exercise 5 Let R — 


a b 
0 c 


a E Z, 6, c E Q 


(i) Show that R is a subring of M 2 (Q). 

(ii) Show that R is not left Noetherian. 

(iii) Let A be a nonzero right ideal of R such that every element of A is 

of the form ^ . Show that A is finitely generated. 
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(iv) Let A be a nonzero right ideal of R such that every element of A is 
a b ' with a 7 ^ 0. Show that A is finitely generated. 


of the form 


0 c 


(v) Show that R is right Noetherian. 


Solution: (i) It is a routine verification to show that R is a subring of 

M 2 ( Q). 

(ii) For any positive integer n, let 


h = 


0 — 

u 2 n 


0 0 


m G Z 


Then each I n is a left ideal of R. Since p = — -^n+i- But 

I n . Therefore, I n C / n +i- Thus, 

/1C/2C/3C- 


0 


1 


2 n + 1 
0 0 


i 


is an infinite strictly ascending chain of left ideals of R. Hence, R is not left 
Noetherian. 

(iii) Case 1: Suppose c = 0 for all ^ 


0 c 


0 b 
0 0 


of A are of the form 
ists a nonzero rational number b such that 

G A. Hence, A — 

Case 2: Suppose b — 0 for all 
Case 1 , we can show that A - 


In this case, the elements 
, where b G Q. Since A 7 ^ {0}, there ex- 

G A. Thus, 


1 

0 

<3- 

_i 


1 - 

O 

O 

1 _ 

1 

0 

0 


.0 I. 


0 1 
0 0 

0 6 

0 c 

0 0 
0 1 


0 b 
0 0 


0 1 
0 0 


R. 


G A. In this case, proceeding as in 


R. 


Case 3: Suppose A contains an element 


0 b 
0 c 


Ifi4 = 


0 b 
0 c 


R : then A is generated by 


such that b 7 ^ 0 and c 7 ^ 0 . 

R C A. 


’ 0 b ' 

r 0 b ' 

0 c 

. Suppose Q c 


Now A contains an element 


0 u 
0 v 


such that bv 7 ^ cu. Then bv — cu 7 ^ 0 and 


0 bv — cu 

0 0 



’ 0 

b' 


- 

0 

u 


0 u 


- 1 

O 

O- 

_ 1 


0 

c 


0 

V 


1 0 

j 

<2 


0 c 


6 A. 
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Now 


0 1 
0 0 


0 bv — cu 
0 0 


l 

bv—cu 


E A and 


0 b 
0 0 


0 1 
0 0 


0 0 
0 b 


Thus, we find that 


0 0 


0 c 


0 b 
0 c 


0 b 


0 0 


E A. Since c ^ 0, this 


implies that jj ^ E A. Hence, A is generated by { 


0 1 
0 0 


0 0 
0 1 


i.e., A is finitely generated. 


°]} 
1 J 


(iv) Let no be the smallest positive integer such that 


no b 


0 c 


E A for 


some 6 , c E Q. We show that A is generated by either { 


0 0 
0 1 


0 It / 
x { 


n 0 0 


0 0 


0 1 
0 0 


n 0 0 
0 0 


0 1 


0 0 


Now 


0 n 0 


0 0 


no b 0 1 
0 c 0 0 


E A. This implies that 


0 1 
0 0 


0 n 0 
0 0 


0 0 


0 n~ 
no 


Thus, 


0 b 
0 0 


0 1 
0 0 


0 0 


0 b 


E A. Again 


n 0 0 
0 0 


no b 10 

0 c 0 0 


E A. 


Hence, 


0 0 
0 c 


no b 
0 c 


n 0 0 

0 0 


0 b 
0 0 


E A. 


If c 7 ^ 0, then it follows that q ^ E A and A is generated by { ^ jj 


0 1 
0 0 


, ^ }. If every element of A is of the form ^ ^ with a ^ 0, 


then A is generated by { 


no 0 


0 0 


0 1 


0 0 


}, where no is the smallest positive 


integer such that ^ q £ A for some b E Q. 


(v) From (iii) and (iv), it follows that every right ideal of R is finitely 
generated. Hence, R is right Noetherian. 
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18.1.2 Exercises 

1 . Give an example of a ring R with the following properties. 

(i) R is left Noetherian, but not right Noetherian. 

(ii) R is left Artinian, but not right Artinian. 

(iii) R is right Artinian, but not left Artinian. 

(iv) R is noncommutative and both Noetherian and Artinian. 

2. Show that a subring of a Noetherian ring may not be Noetherian. 

3. Give an example of a ring R in which every proper ideal is finitely gen¬ 
erated, but R is not Noetherian. 

4. In a right Artinian ring with 1, if ab = 1 for a,b E R, prove that ba = 1. 

5. Prove that every homomorphic image of a left Artinian ring is left Ar¬ 
tinian. 

6 . Let R be a commutative Artinian ring with 1. Show that in R, every 
nonzero prime ideal is a maximal ideal and show that R has only a finite 
number of prime ideals. 

7. Show that a Noetherian domain in which the sum of two principal ideals 
is a principal ideal is a PID. 

8 . Prove that every homomorphic image of an Artinian ring is Artinian. 

9. Let I be an ideal of a ring R. If I and R/I are both Artinian rings, prove 
that R is Artinian. 

10. Let R be a right Artinian ring and I be a nonnilpotent right ideal of R. 
Prove the following. 

(i) The collection T\ of all nonnilpotent right ideals of R which are con¬ 
tained in J, contains a minimal element Iq such that Iq — 1$. 

(ii) Let T — {J | J is a right ideal of R , JIq ^ 0, J C / 0 }- Then T 
contains a minimal element I\ and I\ contains an element u ^ 0 such 
that uIq = I\. 

(iii) I is not a nil right ideal. 

(iv) I contains a nonzero idempotent element. 

11. Prove that a commutative ring R with 1 is Noetherian if and only if every 
prime ideal is finitely generated. 
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12. Let R be a ring with 1. Let / : R —> [0,1] be such that 

f(a-b) > min{/(a) ,/(>)}, 

/(ra) > /(a) 

for all a, 6, r € iL 

(i) Prove that 1? is left Artinian if and only if for every mapping f : R —> 
[0,1] that satisfies the above conditions, |X(/)j < oo. 

(ii) Prove that R is left Noetherian if and only if for every mapping 
/ : R —> [0,1] that satisfies the above conditions, \T(f)\ is a well-ordered 
subset of [0,1]. 

13. Write the proof if the statement is true; otherwise give a counterexample. 

(i) Every finite ring is both Noetherian and Artinian. 

(ii) Every Noetherian domain is a field. 

(iii) Let R be a commutative ring with 1. If R is J-semisimple and Ar¬ 
tinian, then R is regular. 
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Amalie Emmy Noether (1882- 
1935) was born on March 23, 1882, in Er¬ 
langen, Germany, the oldest child. Her fa¬ 
ther, Max Noether, a noted mathematician 
himself, was a professor at the University 
of Erlangen. She studied mathematics and 
foreign languages at Erlangen from 1900 to 
1902. 

In 1903, Noether started her mathemat¬ 
ics career at the University of Gottingen. 

Since at that time girls could not be admit¬ 
ted as regular students, she was a nonma- 
triculated auditor. In 1904, she was permit¬ 
ted to enroll at the University of Erlangen 
and in 1907 she received her Ph.D, summa. cum laude. Her thesis was on algebraic 
invariants. 

In 1915, on Hilbert’s invitation, she went to Gottingen. There she lectured on 
courses given under Hilbert’s name. She applied her invariant theoretic knowledge on 
problems considered by Hilbert and Klein. Hilbert made several personal attempts 
to get her a regular position, but prejudice against women at that time thwarted his 
efforts. Finally, in 1922, she was appointed as an unofficial associate professor; later, 
she received a modest salary. She taught at Gottingen from 1922 to 1933. Due to 
the Nazi regime uprising, all Jewish professors were dismissed in April 1933. Through 
the efforts of Herman Weyl, she was able to get a visiting professor’s position at Bryn 
Mawr College and left for the United States in October 1933. She lectured and did 
research at Bryn Mawr College and at the Institute of Advanced Study. Noether died 
of surgical complications on April 14, 1935. 

Influenced by Hilbert’s axiomatization of Euclidean geometry, Noether became 
interested in an abstract axiomatic approach to ring theory. Between 1922 and 1926, 
she published a series of papers focusing on “the general theory of ideals.” In her 
paper “Abstract construction of ideal theory in the domain of algebraic number fields,” 
published in 1926, she characterized rings in which every ideal is uniquely expressed 
as a product of prime ideals. This is analogous to Euclid’s fundamental theorem 
of arithmetic. Two of the generalized structures she associated with ideals are the 
“group” and the “ring.” She introduced the present-day definition of a ring in her 
paper, “Theory of ideals in a ring,” published in 1921. She showed that the ascending 
chain condition is important to ideal theory. She introduced the concept of a primary 
ideal and proved that in a commutative ring satisfying the ascending chain condition, 
every ideal can be expressed as an intersection of primary ideals. 

In 1932, while working on noncommutative rings in linear algebra with Richard 
Brauer and Helmut Hasse, she proved that every simple algebra over an ordinary 
algebraic number field is cyclic. From 1932 to 1934, she worked on noncommutative 
algebras by means of cross products. 

Noether published 45 research papers. 






Chapter 19 

Modules and Vector Spaces 


19.1 Modules and Vector Spaces 

Our main interest here is to set down only the results of vector spaces which 
are needed for our study of fields in the next chapter. We do this in such a 
way that the reader will become acquainted with the notion of a module. 

Definition 19.1.1 Let R be a ring. A commutative group (M,+) is called a 
left R-module or a left module over R with respect to a m.apping ■ : Rx M —> 
M if for all r,s £ R a,nd m, m! € M, 

(i) r ■ (m 4- m') — r ■ m + r ■ mf, 

(ii) r ■ (s • m) = (rs) • m, 

(Hi) (r + s) ■ m = r ■ m + s ■ m. 

If R has an identity 1 and if 1 • m. = m for all m 6 M, then M is called a 
unitary or unital left R-module. 

A right i?-module can be defined in a similar fashion. 

In the above definition, we used the same notation for the addition in the 
ring R and the addition in the group M. We also used the same notation for 
the multiplication in R and the multiplication between the elements of R and 
M. It should be clear to the reader by now that there are actually four distinct 
operations involved. We write rm for r ■ m. 

Example 19.1.2 In a ring R, every left ideal is a left R-module and every 
right ideal is a right R-m,odule. In particular, R is a left, and right R-module. 

Example 19.1.3 Every commutative group M is a m,odule over the ring of 
integers Z. For n £ Z and a £ M, the elem.ent na is defined to be a added to 
itself n times if n is positive and —a added to itself |n| tim.es if n is negative. 
Oa is defined to be the zero element of M. Under these definitions, M becom.es 
a unitary left 7i-m,odule. 
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Let M be any commutative group and R be any ring. If we define rm = 0 
for all r E R, m E M, then M forms a left 77-module, called a trivial module. 

Since all results that are true for left 77-modules are also true for right 77- 
modules, we prove results only for left 77-modules. From now on, unless stated 
otherwise, by an 77-module, we mean a left 77-module. 

Definition 19.1.4 Let M be an R-module and N be a nonempty subset of M. 
Then N is called a submodule of M if N is a subgroup of M and for allr E R , 
a E N, we have ra E N. 

It is clear that a submodule of an 77-module is itself an 77-module. 

Using arguments similar to those used for subgroups and ideals, one can 
show that the intersection of any nonempty collection of submodules of an 
77-module is again a submodule. 

Definition 19.1.5 Let X be a, subset of an R-module M. Then the subm,odule 
of M generated by X is defined to be the intersection of all subm.odules of 
M which contain X and is denoted by (X) . X is called a basis of (X) if no 
proper subset of X generates (X) . If M — (X) and X is a, finite set, then M 
is said to be finitely generated. When X = { x } and M = {{cc}) , then M is 
called a cyclic R-m,odule and in this case we write M = ( x) . 

We ask the reader to prove that any finitely generated module has a finite 
basis. 

The proof of the following theorem is similar to that of the corresponding 
theorem for ideals, Theorem 11.2.7. Hence, we omit its proof. 

Theorem 19.1.6 Let M be an R-module and X be a, nonempty subset of M. 
Then 

(X) = {Ej=i fiXi + T!j=\ n j x 'j | n E R,nj E Z, x i7 xb e X, 

1 < i < k,l < j < l,k,l € N}. 

If M is a unitary R-m,odule, then 
k 

(X) = riXi | ri £ R, Xi G X, 1 < i < k, k E N}. ■ 

i= 1 

Example 19.1.7 (i) Q is a Q-module. If N is a subm,odule of Q, then N is 
a left ideal of Q. Since Q is a field, the only left ideals of Q are {0} and Q. 
Hence the subm,odules of Q are {0} and Q. 

(ii) We know that Q ® Q is a commutative group. For all x £ Q and for 
all (a,b) E Q © Q, define x(a,b) = ( xa,xb ). Then Q 0 Q is a Q -m.odule. We 
now determine all submodules of Q © Q. Let M be a nonzero Q-subm.odule of 

Q® Q. 
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Case 1: Suppose for all (a, 6 ) E M, 6 — 0. Now there exists (a, 0) E M 
such that a ^ 0. Then (1,0) = ^(a,0) E M. Thus, M = Q © {0}. 

Case 2: Suppose for all (a, 6 ) E M , a = 0. Now there exists (0, 6 ) E M 

such that 6 0. Then (0,1) = £(0, 6 ) E M. Thus, M — {0} ® Q. 

Case 3: Suppose there exists (a, 6 ) E M such that a 0, 6 0. 

Case 3a: Suppose M = ((a, 6 )) . Then M is a cyclic submodule of Q © Q 
generated by (a, 6 ). 

Case 3b: Suppose M ((a, 6 )). Then ((a, 6 )) C M. Thus, there exists 

(a', 6 ') E M\((a, 6 )). Then a' ^ 0 or 6 ' ^ 0. Suppose that a' — 0. Then 

(0,1) = ^(0,6') E M. Therefore, (a,0) = (a, 6 ) — (0,1)6 E M. Hence, (1,0) = 
l(a,0) E M. Thus, (1,0), (0,1) E M. This implies that M — Q©Q. Similarly, 
if b' = 0, then M = Q © Q. 

Now suppose that a! ^ 0 and b' 0. If = y — t (say), then t(a',b') = 
(ta',tb f ) = (-^a',yb') = (a, 6 ) E ((a, 6 )) , which is a contradiction. Therefore, 
gj and so ab' — ba! 7 ^ 0. Let (p, q) E Q © Q. Choose t = and 

s = Then (p,q) = t{a,b) + s(a', 6 ') E M. Thus, Q © Q C M. Hence, 

M = Q © Q- 

Consequently, if M is a Q-submodule of Q © Q, then M is of the following 
form: 

(i) M — {0}, or 

(ii) M = {0} © Q = (( 0 , 1 )) , or 

(iii) M = Q©{0} = ((l l 0)),or 

(iv) M = ((a, 6 )) , a 7 ^ 0, 6 ^ 0, a, 6 E Q, or 
(y) M = Q © Q. 

This also proves that M is finitely generated. 

Definition 19.1.8 Let F be a field. A unitary (left,) F-module M is called a 
(left) vector space over F. The elements of M are called vectors and the 
elements of F are called scalars. A subm,odule of M is called a subspace of 
M. If X is a subset of M such that M = (X) , then X is said to span or 
generate M and M is called the span of X over F. 

Example 19.1.9 Let F be any field and F n denote the Cartesian product of F 
with itself ntim.es. Then F n becom.es a vector space over F under the following 
definitions: For all (cq, < 22 ,..., a n ), ( 6 j, 62 ,..., 6 n ) E F n and a E F 

( a i, a 2 , • • •, a n) + ( 61 , 62 ,...., b n ) — (ai + 61 , ©a .2 + 62 ,..., a n + 6 n ), 

n(o >2 , (22 ,..., a^i ) (rmi, (m 2 ,«.., aorfi. 


X = {(1,0,0,..., 0), (0,1,0,..., 0),..., (0,0,0,..., 1)} 
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spans F n since for all (ai, 02 ,..., a n ) G F n , 

( a i, 02,..., a n ) = oi(l, 0 , 0 ,..., 0 ) + 02(0, 1 , 0 ,..., 0 ) + • • • + a n ( 0 , 0 , 0 ,..., 1 ). 

When n = 2 or 3 and F is the field of real numbers, then the vector space F n 
over F is the one usually encountered in elementary analytical geometry. 

By Example 19.1.9, R 3 is a vector space over R. 

Example 19.1.10 Consider the vector space R 3 over R. Let 

U — {(a, b , c) G R 3 | 2 a + 3 b + 5c — 0}. 

Then U is a subspace of V 3 (R). Let 

U\ — {(o, 6 , c) € R 3 | 2 a + 36 + 5c = 5}. 

Now (0,0,1) and ( 1 , 1 , 0 ) € U\, but (0,0, 1 ) + ( 1 ,1, 0) ^ U\. Hence, U\ is not a 
subspace of R 3 . 

Example 19.1.11 Let V be a vector space over F. Then {0} and, V are sub¬ 
spaces of V. These are called, trivial subspaces of V. 

Theorem 19.1.12 Let V be a, vector space over F and S be a nonempty subset 
of V. Then S is a subspace of V if and only if for all a G F and for all x,y G S, 
ax + y G S. 

Proof. Suppose S is a subspace of V. Then for all a G. F and for all x, y G S , 
ax G S and so ax 4- y G S. Conversely, suppose for all a G F and for all 
x, y G S, ax + y G S. Since S ^ <fi, there exists x G S. By Exercise 2 (page 431), 
—x = ( — l)x. Therefore, 0 = — x + x = ( — l)x 4- x G S. Hence, for all x G S, 
—x = ( —l)x A 0 G S. Also, for all x, y G S, x + y = lx + y G S. S inherits the 
associative and commutative laws. Thus, (S, +) is a commutative group. Now 
for all a G F and for all x G S, ax = ax 4- 0 G S. Therefore, S' is a vector space 
over F since the other properties are inherited. ■ 

Theorem 19.1.13 Let V be a vector space over F and {U a | a £ 1} be any 
nonempty collection of subspaces of V. Then D a £lU a is a subspace of V. 

Proof. First note that 0 G U a for all a G I and so 0 G n aG /C/ a - Therefore, 
C a QiU a 7 ^ (f). Let a G F and x, y G n QG /£/ Q . Then x,y G U a for all a. Since U a 
is a subspace of V, ax + y G U a for all a £ I and so ax + y G H a£iU a . Thus, 
r\aeiU a is a subspace of V by Theorem 19.1.12. ■ 

Theorem 19.1.14 Let V be a vector space over F and S be a nonempty subset 
ofV. Then 

{S) = | a, G F, Si G 5}, 

where 'ffaiSi is a finite sum,. 
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Proof. Let U = {]C a i s % \ a i G F, Si £ S}. Let a £ F and Y a 2 Si, Y bjSj £ U. 
Then a(X^s-i) + Y^jSj — Y( aa i) s t + Y^bjSj £ U and so U is a subspace of 
V by Theorem 19.1.12. Since for all s £ S, s = Is £ U, U D S. Thus, U D { S) 
since ( S ) is the smallest subspace of V containing S. Let Y a i s i £ U. Then 
since SiE.SC (S) , a{Si £ ( S) . Thus, Y a i s i £ {S) > whence U C (S) . ■ 

Definition 19.1.15 Let V be a vector space over the field F. A subset X of V 
is called linearly independent over F if for every finite number of distinct 
elements x\, X 2 ,..., x n £ X, a\x\ + 02 x 2 + • ■ • + a n x n = 0 implies that ai = 
02 = • • • a n = 0 for any finite set of scalars {ai, 02,..., a n }. Otherwise X is 
called linearly dependent over F. 

The set X in Example 19.1.9 is linearly independent over F. {0} is linearly 
dependent over F. 

Definition 19.1.16 Let V be a vector space over F. A subset AofV is called 
a basis for V over F if A spans V, i.e., V — ( A) , and A is linearly independent 
over F. 


Consider the zero vector space, {0}, over the field F. We note that the 
empty subset, 0, is linearly independent over F vacuously and that <p spans 
{0}. Hence, 0 is a basis for {0}. 


Example 19.1.17 The set 

X = {(1.0,0.0), (0,1,0,... ,0).(0,0,0,...,1)} 

of Example 19.1.9 is a basis for F n . We showed there that X spans F n over F. 
Suppose 

(0, 0,..., 0) = a\ (1,0,0,..., 0) + 02 ( 0 ,1,0,..., 0) + • • • + a n (0,0,0,..., 1). 

Then (0,0,..., 0) = (ai, 02, ..., a n ). Therefore, we must have = 0 for i = 
1, 2,..., n. Thus, X is linearly independent. 


Theorem 19.1.18 Let V be a vector space over F a.nd S be a subset of V. If 
s £ (S ), then (S U {s}) = (S'). 


Proof. Clearly (S) C (S U {s}). If S = <j>, then (S) = {0} and so s = 0. 
Hence, (S U {s}) = ({0}) = {0} = (S) . Suppose S ^ (j>. Let Y a i s i + as £ 
(S U {s}) , where s z £ S. Then Y a i s i, as € (S) and so Y a i s i + as £ (S) . 
Hence, {S U {s}) = (S ). ■ 


Theorem 19.1.19 Let V be a vector space over F and A — {xi, X 2 , ..., x r } 
be a subset ofV which spams V. Let B be any linearly independent set of vectors 
in V. Then B contains at m.ost r vectors. 
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Proof. If B contains less than r vectors, the theorem is true. Suppose B 
contains at least r vectors, say, yi, 2 / 2 ,..., y T £ B. Then since A spans V, 

T 

yi — 'y ] anXi 

i= 1 

and since y\ 7 ^ 0, not all an =0, say, an ^ 0. Thus, 

r 

x\ = Y^(- a nan)xi + a^yi. 

i =2 

This implies that x\ £ {{2/1, X2,..., x r }). Hence, {{yi, X2, • ■ ■, x r }) = V by 
Theorem 19.1.18. Assume ({2/1, yi, ■ ■ . ,yk, x k+i, ■ ■ -> x T }) = V, the induction 
hypothesis. Then 


yk+i € ({yiiy2> ■ ■ • yki 3 -k+iy ■ • ■ %r}) 


Thus, 


Vk +1 — ^ ] a i,k+iyi T ^ 

i—1 i=k +1 

and not all a^/- + i — 0 for i = k + 1,... ,r, say, ak+ 1^+1 7 ^ 0- This implies that 

k r 

x k+l ~ '^,(— a k+l t k+l a "i,k+'l)yi + (~ a k+l,k+l a i,k+l) x i A a k+l,k+iyk+l- 
i— 1 i=k+2 

Thus, x k +i £ ({yi,y 2 , ■ • • ,y k ,yk+i, x k+ 2 , ■ ■ - ,x r }). Hence, 


V {{2/1 > 2/2? ■ • ■) yki yk+i ? %k+2, • • • j }) 

by Theorem 19.1.18. Thus, {{yi, 2 / 2 , • ■ ■, y r }) — V by induction. If there ex¬ 
ists y £ B such that y yi , i = 1,2, . ,.,r, then y = Yli=\ a iyi and so 
0 = Y7i=\ a iVi + (-1 )y and since -1 ^ 0, 2 / 1 ,2/2, • y r , y are not linearly 
independent, a contradiction. Therefore, y does not exist and so B = { 2 / 1 5 2 / 2 » 
..., y r }. ■ 


Theorem 19.1.20 Let V be a vector space over F, A = {xi,..., x T ], and. 
B = { 2 / 1 ,. - -, 2/s} be two bases for V. Then r = s. 


Proof. Since A spans V and B is linearly independent, s < r by Theorem 
19.1.19. Similarly, r < s. ■ 


Definition 19.1.21 Let V be a vector space over F. IfV is spanned by a finite 
set of vectors, then V is called finite dimensional over F. 


Lemma 19.1.22 Let V be a vector space over F and A be a linearly indepen¬ 
dent subset ofV. If x £V and x £ (A) , then A U {x} is linearly independent. 
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Proof. Let x\,... , x n E A Suppose 0 — a\X\ + < 22 X 2 + • • • + a r x T + ax. 
Suppose a 0. Then 

x = (—a) _1 uiXi + • ■ • + (— a)~~ l a T x r E (A ), 

a contradiction. Thus, a — 0. Hence, 0 = 01 X 1 + 0 . 2 X 2 + • • -+a T x r . Since {xi,X 2 , 
..., x r } is linearly independent, 01 = 0, ..., a T = 0. Thus, A U {x} is linearly 
independent. ■ 

Theorem 19.1.23 LetV be a finite dimensional vector space over F. ThenV 
has a basis. 


Proof. If V = {0}, then 0 is a basis for V. We now assume that V 7 ^ {0}. 
Let xj E V be such that xi 7 ^ 0. Then xi is linearly independent. If (xi) 7 ^ V, 
then there exists X 2 E V such that X 2 £ (xi) . By Lemma 19.1.22, xi and 
X 2 are linearly independent. Suppose xi,...,xjt E V are linearly indepen¬ 
dent and {{xi,...,Xfc}) 7 ^ V. Then there exists Xfc+i E V such that Xk+i ^ 
{{xi,...,X*.}). Therefore, xi,... ,x*., x^+i are linearly independent. Since V 
is finite dimensional, V is spanned by, say, r vectors. By Theorem 19.1.19, 
any linearly independent set of vectors in V cannot have more than r vec¬ 
tors. Hence, if we continue the above process of constructing x^’s, then there 
must exist a positive integer s such that {xi,... ,x s } is linearly independent, 
{{xi,..., x s }) = V ., and s < r. Thus, {xi,... ,x 5 } is a basis of V. ■ 

Theorem 19.1.23 gives us a method for constructing a basis for a finite 
dimensional vector space V of dimension n over F. We first take any nonzero 
vector x\ of V. If {x\) = V, then {xi} is a basis of V. If (xi) C V, then we take 
any X 2 E V, X 2 ^ (xi). Then by Lemma 19.1.22 {xi, X 2 } is linearly independent 
over F. If {{xi,X 2 }) = V, then {xi,X 2 } is a basis for V over F. If ({xi,X 2 }) C V, 
we can choose X 3 E V, X 3 ^ ({xi,X 2 }) and so on. In a finite number of steps, 
precisely n steps, we must arrive at a basis for V over F. 


Definition 19.1.24 Let V be a finite dimensional vector space over F. The 
dimejision V is the number of elements in a basis for V. 

From the statements following Definition 19.1.16, it follows that the zero 
vector space, {0}, is of dimension 0. 


Theorem 19.1.25 Let V be a vector space of dimension n over the field F. 
Then X = {xi, X 2 ,.. -, x n } is a basis of V if and only if every vector in V is a 
unique linear combination of X \, X 2 ,..., x n over F. 
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Proof. Suppose X is a basis of V over F. Then by Theorem 19.1.14, every 
vector v G V is a linear combination of x 1} X 2 ,...,x n . Let 

V - a\X\ ”1“ * ■ * ”1“ — f?]_X]_ —f“ * ■ * ”t~ b'fi x vi 

be any two linear combinations of xi,X 2 , ..., x n . Then 

0 = (ai - 6 i)xi -i-f- (a n - b n )x n . 

The linear independence of X over F implies that ai — b\ = 0, ..., a n — b n — 0. 
That is, the representation of v as a linear combination of xi, X 2 ,..., x n is uni¬ 
que. Conversely, suppose every vector in V is a unique linear combination of 
xi, X 2 ,..., x n over F. Then clearly X generates V over F. Suppose 0 = aixi + - • ■ 

+a n x n for ^ € F. Since also 0 = Oxi 4-+0x n , we have ai = 0, i = 1, ..., n. 

Thus, X is linearly independent over F. By definition, X is a basis of V over 

F. ■ 

We now show that every nonzero vector space, not necessarily finite dimen¬ 
sional, has a basis. For this we prove the following lemma. 

Lemma 19.1.26 Let V be a vector space over a field F a.nd X be a nonempty 
subset of V. Then X is a basis for V if and only if X is a maximal linearly 
independent set over F. 

Proof. If X is a basis for V, then X is linearly independent over F and 
(X) = V. Let y E V, y ^ X. Then V = (X) C (X U {y}} C V so that 
V = (X U {y}) . Since the proper subset X of X U {y} also generates V, X 
U{y} cannot be linearly independent over F. Thus, X is a maximal linearly 
independent set over F. Conversely, let X be a maximal linearly independent 
set over F. It suffices to show that V — (X ). If V D (X ), then there exists 
y £ V, y (X). By Lemma 19.1.22, X U{y} is linearly independent over F, 
which contradicts the maximality of X. Thus, V = (X) . ■ 

Theorem 19.1.27 Let V be a vector space over the field F. Then V has a 
basis. 


Proof. If V = {0}, then 0 is a basis for V. We now assume that V {0}. 
Let x be a nonzero element of V. Then {x} is a linearly independent subset 
of V. Let T be the set of all linearly independent subsets of V that contain 
{x}. Clearly T ^ (ft. T is a poset with respect to the set inclusion relation. By 
Zorn’s lemma, we can show that T has a maximal element, say, X. Then X is 
a maximal linearly independent subset of V and by Lemma 19.1.26, it follows 
that X is a basis of V. ■ 

Finally, we state the following theorem without proof. The finite dimen¬ 
sional case was proved in Theorem 19.1.20. 
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Theorem 19.1.28 Let V be a vector space over a field F. If A and B are two 
bases ofV, then \A\ = \B \. ■ 

From Theorem 19.1.27, we find that a vector space V over a field F has a 
basis B. If B is a basis for V over F , then |R| is called the dimension of V 
over F. 

19.1.1 Worked-Out Exercises 

0 Exercise 1 Let V be a vector space of dimension n. Show that any set of 
n linearly independent vectors is a basis of V. 

Solution: Let B be a set of n linearly independent vectors. Suppose 
V ^ (B) . Let y € V be such that y ^ { B) . Then B U {y} is a set of n + 1 
linearly independent vectors by Lemma 19.1.22, a contradiction to Theorem 
19.1.19. Hence, B is a basis of V. 

<0 Exercise 2 Let u\ = (0,1,1,0), U2 = (1, 0,1, 0), and 113 = (— 1 , —2,0,0) be 
three vectors in R 4 . Show that {ui, U 2 , W 3 } is a linearly independent set. 
Extend this set to a basis of R 4 . 

Solution: Let a\, 02 , <23 € R be such that 

a\u\ + a2U2 + 03^3 = 0 . 

Then Q -2 — a 3 — 0, a\ — 203 = 0, and ai + <22 — 0. From this, it follows that 
a\ — d 2 = <23 = 0. Hence, {-ui, 142 ,^ 3 } is a linearly independent set. Suppose 

(0,0,0,1) € ({iti,« 2j « 3 }) • 

Then there exists 61 , 62,63 G R such that 


biui + b 2 u 2 + & 3 U 3 = ( 0 , 0 , 0,1). 

Thus, 62 — 63 = 0, bi — 263 = 0, bi + 62 = 0, and 1 = 0, a contradiction. 
Therefore, e 4 = (0,0,0,1) ^ ({^ 1 , 162 ,^ 3 }) • Hence, {u\, U 2 , U 3 , e^} is a linearly 
independent set of vectors in R 4 . Since the dimension of R 4 is 4, {^i, U 2 , U 3 , e^} 
is a basis. 

<0 Exercise 3 Let V be a nonzero vector space of dimension n. Let X be a 
finite subset of V such that V = (X) . Show that X contains a subset Y 
such that Y is a basis of V. 
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Solution: Let X — {xi, X 2 ,..., X*}. Clearly t > n. Since V ^ {0}, X 
contains a nonzero element. Thus, X contains a linearly independent subset. 
If X is linearly independent, then X is a basis of V and n — t. Suppose 
X is not linearly independent. Then there exists X{, say, xt, such that x t G 
({#i, X 2 ,. -., £t_i}) . Then V = {{ xi,x 2 , - • -, xt-i )}• Let s = t - n — 1 . By 
repeating the process finitely many times, we can show that there are s vectors 
Xi 1} ..., x is G {xi, x 2 , , x t -i} such that 

Xii, • • • 5 Xi s G {{xj, X 2 , ■ • • j i , . . . , Xi s }) . 

Let 

Y — , x 2 f ■ ■., Xf^i ,..., Xi s j-. 

Then Y C X, |T| = n, and V = { Y) . If Y is not linearly independent, then 
there exists y G Y such that y G {X\{ 2 /}) • Then V — (T\{y}) and |T\{^}| = 
n — 1, a contradiction to the fact that the dimension of V is n. 

Exercise 4 Let T ~ {(x,y,z) G R 3 | 2x + 3y + z = 0}. Show that T is a 
subspace of V 3 (R). Find a basis for T. 

Solution: Since (0,0,0) G T, T ^ <fi. Let (xi, r/i, zi), (x 2 , y2 , z 2 ) G T and 
r G R. Then 2x\ + 3y\ + z\ = 0 and 2x2 + 3^2 + 22 = 0. Hence, 2{x\ + x 2 )+ 
3(2/i + 2 / 2 )+ (zi + z 2 ) =0 and 2rx\ + 3ryx + rz\ — r(2x\ + 3y\ + z\) — 0. 
Therefore, (x 1} 2 / 1 , ^ 1 )+ (X 2 , 2/2, 22 ) £ T and r(xi, 2 / 1 , z\) G T. Thus, T is a 
subspace of V^R). Now 2xi 4- 3y\ + 21 = 0 implies that (xi, 2 / 1 , ^ 1 ) = (xi, 
2 /i, -2xi -32 /i) = xi(l,0, —2)+ 2/i(0,1, -3). Since (1,0, -2), (0,1, -3) G T and 
(xi, 2 / 1 , z\) is an arbitrary element of T, T ~ ({(1,0, —2), (0,1, —3)}) . It is 
easy to verify that {(1,0, —2), (0,1, —3)} is a linearly independent set. Hence, 
{(1,0, -2), (0,1, —3)} is a basis of T. 

19.1.2 Exercises 

1. For the vector space R 3 over R, determine whether or not the sets listed 
are bases of R 3 . 

(i) {(1,1,0), (1,1,1), (1,0,0)}. 

(ii) {(2,0,0), (0,2,0), (0,0,2)}. 

(hi) {(-1,0,0), (0,-1,0), (0,0, -1)}. 

(iv) {(1,0,0), (1,1,0), (1,1,1), (0,1,0)}. 

2. Let M be an itbmodule, m G M and r G R. Prove that rO = 0, 0m = 0, 
and -(rm) = (—r)m = r(— m). 

3. Show that the intersection of two submodules of an iLmodule M is a 
submodule. 
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4. Show that the Z-module Q has no finite set of generators. 

5. Find all subspaces of the real vector space R 2 . Is is true that for any 
elements u = (a, 6 ) and v — (c, d) of R 2 , there exists a nontrivial subspace 
W of R 2 such that u, v E W? 

6 . Let A, R, and C be submodules of an R-module M. 

(i) Prove that A + B = {a+ b\ a£ A, 6 e 5} is a submodule of M. 

(ii) If ACC, prove that A + (B fl C) = (A + B) n C. 

7. Let M be an R-module and a E M. Show that T — {ra -t- na \ r E R, 

n E Z} is a submodule of M. 

8 . Let M be a unitary .R-module. M is called a simple R-module if M/ 
{0} and the only submodules of M are M and {0}. Prove that M is 
simple if and only if M is generated by any nonzero element of M. 

9. Let M be a unitary R-module. M is called Noetherian if for any se¬ 
quence 

A y C A 2 C • • • C A n C • - • 

of submodules of M, there exists a positive integer n such that A n = 
A n+ 1 = .... Prove that M is Noetherian if and only if every submodule 
of M is finitely generated. 

10. Let M be a unitary R-module. M is said to satisfy the maximal condition 
on submodules if any nonempty collection of submodules of M has a 
maximal element. Prove that M is Noetherian if and only if M satisfies 
the maximal condition for submodules. 

11. Let M be a unitary R-module. M is called Artinian if for any sequence 

Ai D A 2 3 ■ • • D A n D • - • f 

of submodules of M, there exists a positive integer n such that A n = 
A n+ 1 = • • •. Prove that M is Artinian if and only if any nonempty set of 
submodules of M has a minimal element. 

12. Let A be a submodule of a unitary R-module M and a E M. Let 

ci + N — {a -|- 6 | b E IV}. 


Prove the following. 

(i) fl E fl + N. 

(ii) For all a, b E M, a + N = b + N if ana only if a — b E N. 

(iii) For all a, 6 E M, either (a + N) fl (b + N) = 0 or a + N = b + N. 
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13. Let N be a submodule of an R- module M. Let 

M/N = {a + N \ a <E N}. 

Define the following operations on M/N 

(.a + N) + (b + N ) = (a + b) + N 
r{a + N) — ra + N 

for all a + TV, b + TV £ M/N , r £ R. Prove that M/N is an i?-module. 

14. Let IV be a submodule of an /^-module M. Prove that M is Artinian 
(Noetherian) if and only if N and M/N are Artinian (Noetherian). 

15. Let V be a finite dimensional vector space over F. If U and W are two 
subspaces of V, prove the following: 

(i) U + W = {u + w | u £ U, w £ W} is a subspace of V. 

(ii) dimU+ dim W = dim(C7 + W)~ dim(Z7 fl W). 

16. Let IV be a submodule of an A!-module M. N is called a direct summand 
of M if there exists a submodule P of M such that M — N 4- P and 
N fl P = { 0 }.'In a finite dimensional vector space V over F, show that 
every subspace is a direct summand of V. 

17. Write the proof if the statement is true; otherwise give a counterexample. 

(i) If {u,v,w} is a linearly independent subset of a vector space V, then 
{■u, u + v, w + v + u;} is also a linearly independent subset. 

(ii) If W is a subspace of a finite dimensional vector space V such that 
dim W — dim V, then W = V. 

(iii) Let V be a vector space over a field F. If 0 ^ v € V, then there exists 
a basis containing v. 

(iv) If S and T are two basis of a vector space V, then S U T is a basis of 

V. 
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Chapter 20 

Rings of Matrices 


In this chapter, we study some elementary properties of rings of matrices. Rings 
of matrices provide a rich source of examples for noncommutative ring theory. 
They are also useful for the understanding of noncommutative ring theory since 
they often appear in representation theorems. 


20.1 Full Matrix Rings 


Let R be a ring with 1. Let M n (R ) be the ring of all nxn matrices with entries 
from R. Let Eij be the element of M n (R ) whose (f, j) entry is 1 and all other 
entries are 0, 1 < i,j < n. Let Eij,Eki G M n (R). Then the following can be 
easily verified. 


Eij Eki — 


Eii if j = k 
0 if j ^ k. 


Let (ctij) G M n (R). Then (a tj ) = a ij E ij- 

The following describes ideals of M n (R). 


Theorem 20.1.1 Let M be cm ideal in M n (R). Then there exists an ideal I of 
R such that M = M n (/), i.e., M is the set of all nxn m, a trices with entries 
from I. 


Proof. Let I — {a E R \ a = an for some (a^-) G M}. Since 0 G M, 0 G I. 
Thus, 7 / 0 . Clearly if a, b G 7 , then a — b G 7 . Let a G 7 and r G R. Then 
a = an f°r some i a ijEij G M. Since M is an ideal, 


arEu = E n {^2 o-ijE^rEn G M. 
i,j-1 
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Hence, ar € /. Similarly, since 


raEu = rE u (^2 a-ijE^Eu £ M, 
i,j=1 

ra E /. Thus, / is an ideal of i?. We now proceed to show that M = M n (I). 
Let a ijEij € M. Let 1 < k, l < n. Now 


^ a klE\\ — E\k{ 33 a ijEij)En E M. 

i,j =i 

Therefore, E / for 1 < &, / < n. This implies that a ijE 10 E M n (I). 

Thus, M C M n (I). Conversely, let Y/ij=ibijEij E M n (I). Let 1 < A :,l < n. 
Then = cn for some ^b =1 <hjE l j E M. Since M is an ideal, 


fr/d-E’w — cuEki = Ek i( c ijEij)Eu E M. 

*d=i 

Therefore, E t j E Af. Thus, M n (I) C M. Consequently, M = M n (I). ■ 

Corollary 20.1.2 Let R be a ring with 1. If R is simple, then M n (R ) is 
simple. ■ 

Theorem 20.1.3 Let R be a ring with 1. Let I be an ideal of R. Then 

M n (R) / M n (I) ~ M n (R/I ). 

Proof. Define / : M n (R ) —» Af n (R/I) by 

f(( a ij )) ~ T /) 

for all (o^) E M n (R). Then it can be easily verified that / is an epimorphism 
and Ker / = M n (I). Hence, 

M n (R)/M n (I) ~ M n (R/I ). ■ 

In the next theorem, we describe the center of M n (R) when R is a commu¬ 
tative ring with 1. 

Theorem 20.1.4 Let R be a, comm.uta.tive ring with 1. Then C(M n (R )), the 
center of M n (R), is the set of all scalar matrices in M n (R). 
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Proof. Let E£=i aE kk , Yaj= i £ M n (R). Then 


TL Tl 71 71 

i^^ciEki c)( 2 y bijEij) — y db k jE k j ~ ^ ] db TS E TS 

k—l i,j =1 /c,j = l r,5=1 


and 


since H is commutative. Therefore, 


Xyfc,7=l bi,k a Fik 
Er,s=l b rs dE TS 
Et-,s =1 ®brsE r s 


Tl TL 

( dEkk) ( bijFij ) 


fc=i hj=i 


n n 

( ^ b^Eij)dEkk)- 


i,j=1 fc=l 


Thus, £Li aEfcfc € C(M n (H)). 

Now, let g = J2i,j=i a ijEij <E C(M n (R)). Let E k k € M n (R ), 1 < k < n. 
Then 


n 


Ekk(Y. 

i,j=l 


n 

dij Eij) ( y dijEij)E k k 
i,j =1 


implies that Y^^\ a kjEkj = Y7i=i a ikEik and hence by comparing the corre¬ 
sponding entries, we get a**, — 0 = a k j for all i,j = 1 ,2,..., n, i ^ A:, j ^ k, 
1 < k < n. Thus, all entries in g are zero except (possibly) the diagonal entries. 
Hence, g = EaU l a kkE k k- Let E rs G M n {R) be such that r / s, 1 < r, s < n. 
Then 

n n 

( y ' ^kkEkk)E rs — E rs ( a k k E k k) 

fc=1 fc=l 


implies that a TT E TS = a ss E rs . Therefore, d rT = a ss , r ^ s, 1 < r, s < n. 
Consequently, g is a scalar matrix. ■ 


20 .1.1 Worked-Out Exercises 

<2 Exercise 1 Let H be a ring with 1. Let A and H be ideals of H. Show that 
M n (AH) = M n (A)M n (H). 

Solution: Let Eij =i c 7 j E^ G M n (AH). Then G AH. Let 

S 

Cjj — y ( a iibi t G AH, 
z=i 

di t G A, bi t G H, 1 < / < s. Then 

c ijEij — ( Ez=i diMEij 

— Ez=iKA^) 

= GM n (A)M n (H). 
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Thus, YA,j=\ c ijEij € M n (A)M n (B ) and so M n (AB ) C M n {A)M n (B). Let 
CLjjEjj E M n (A~) and bijE t j E Af n (S). Let i cgicbicj, 

1 < < n. Then c 2 j = a ikbkj E AB, 1 < f, j < n. This implies that 

n n n 

( ^2, bijEij) = CijEij E M n (AB). 

\ hj=i M=1 *.J=1 

Thus, M n (A)M n (B ) C M n (AB). Consequently, M n (AB ) — M n (A)M n (B). 

<0 Exercise 2 A ring i? is called a prime ring if {0} is a prime ideal. Prove 
that a ring R with 1 is a prime ring if and only if M n (R ) is a prime ring. 

Solution: Suppose R is a prime ring and M n (R) is not a prime ring. 
There exist nonzero ideals P and Q of M n (R) such that PQ = the zero ideal 
of M n {R). There exist nonzero ideals A and B of R such that P — M n (A ) and 
Q — M n (B). Thus, PQ = M n (A)M n (B ) implies that AB — {0}. Since R is a 
prime ring, A = {0} or B — {0}, which is a contradiction. Hence, M n {R) is 
a prime ring. Conversely, suppose that M n {R ) is a prime ring and R is not a 
prime ring. Thus, there exist nonzero ideals A and B of R such that AB = {0}. 
Then M n (A), M n (B) are nonzero ideals of M n (R ) such that M n (A)M n (B ) = 
the zero ideal of M n (i?), a contradiction. Hence, R is a prime ring. 

20.1.2 Exercises 

1. If R is a field, find all ideals of M n {R). 

2. If R is a Noetherian ring, prove that M n (R ) is a Noetherian ring. 

3. If R is a ring with 1 , prove that M n (R)[x] ~ M n (R[x\). 


20.2 Rings of Triangular Matrices 


Let A,B, and C be rings with identity such that cEa is a unital bimodule, 
i.e., B is a unitary right A-module, B is a unitary left C-module and for all 
a E A, for all b E B, and for all c E C, c(ba ) = ( cb)a . 

Let 


R = 


a 0 
b c 


a E A, be B, ceC 


1 

" A 

0 " 

r 

B 

C 


Define + and • as the usual matrix addition and multiplication, i.e., if 
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a' 0 


L/ , £ R, then 

b c 


a 0 
b c 

a 0 
b c 


- J 

p 

0 * 


b' 

d 


-r 

P 

o 

_ 1 


i- 

o- 

d 



a -f~ o! 0 
6 + 6' c + d 

act! 0 
bo! + cb' cd 


Since bo! + cb' £ -B, +- and • are well defined. It is easy to check that ( R , +, -) 
is a ring. 

Consider A©B, the (external) direct sum of the rings A and B. Let (a, b) £ 
A © B and a' £ A. Define ( a,b)a' = ( aa',ba'). Then A © B is a unital right 
A-module. 

We now proceed to describe all right ideals of R. 

Let I be a right ideal of R. Let 


a 0 


M = < (o, b) £ A © B | ^ c € I, for some c £ C 


K = < c £ C 


a 0 
b c 


£ I, for some a £ A, b £ B > . 


We now claim the following: 

(i) M is a right A-submodule of A © B\ 

(ii) K is a right ideal of C; 

(hi) {0} 0 KB C M. ~ 

(i): Clearly M ^ <fi and (M,+) is an Abelian group. Let (a, b) £ M 


and a' £ A. Then f ^ £ I for some c £ C. Now °! ^ £ I and 

be be 


a' 0 

0 0 


£ R. Since I is a right ideal of R, 


aa! 0 
ba' 0 


a 0 a' 0 

be 0 0 


Thus, ( a,b)a' = (aa',ba') £ M. It is now easy to verify that M is a right 
A-submodule of A® B. 

(ii): Clearly K ^ <p y and (if, +) is an Abelian group. Let k £ K and c £ C. 
Then ^ ^ £ I for some a £ A, b £ B. Since I is a right ideal of R, 


b k 


0 0 

0 kc 


a 0 0 0 

b k 0 c 


7^a^Lc#fta£uiaZ PAy.i.ZcJ. 



20.2. RINGS OF TRIANGULAR MATRICES 


439 


Thus, kc E K. Hence, K is a right ideal of C. 

a 0 


(iii): Let k E K, y E B. Then 
is a right ideal of R , 


b k 


E I for some a E A, b E B. Since I 


0 0 " 


a 0 


' 0 0 " 

ky 0 


b k 


y o 


Hence, (0, ky) E M. Thus, {0} © KB C M. 

Conversely, let J\lT and K be defined as in (i), (ii), and (iii). Let 


7 = 


a 0 
b c 


E R | (a, b) E M, c E K 


Let 


a 0 
b c 


e 7 , 


x 0 

y z 


E R. Then 


a 0 


x 0 


ax 0 

b c 


y z 


bx + cy cz 


Now (a, b) E M, x E A. Since M is a right A-submodule of A © B, ( ax,bx ) = 
(a,b)x E M. Now c E K and y E B. Hence, (0,cy) E {0} © KB C M. Thus, 
(ax, bx + cy ) = (ax, 6x) + (0, cy) E M. Since K is a right ideal of C, cz E K. 


Therefore, 

a 0 
b c 


x 0 

y z 

— 

ax 0 

bx + cy cz 

L_ _J 1— _ 

verified that 7 is a right ideal 

of R 

. 


E 7. Now it can be easily 
We summarize the above discussion in the following theorem. 


Theorem 20.2.1 Let R,A,B, and C be defined as above. Let M be a right 
A-subm,od,ule of A® B, K be a right ideal of C, and {0} © KB C M. Let 


1 = 


a 0 
b c 


E R | (a, b) E M, c E K 


Then I is a right ideal of R. Conversely, let I be a right ideal of R. Then there 
exists a right A-submodule M of A® B and a right ideal K of C such that 
{0} © KB C M and 


I = 


a 0 
b c 


E R | (a,b) 6 M, c E K 


The following theorem, which is dual to the above theorem, can be proved 
in a similar manner. We leave its proof as an exercise. 
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Theorem 20.2.2 Let R,A,B, and C be defined, as above. Let M be a left 
C-submodule of B © C, K be a left ideal of A, and BK 0 {0} C M. Let 


I = 


a 0 
b c 


£ R | (6, c) G M, a £ K 


Then I is a left ideal of R. Conversely, let I be a left ideal of R. Then there exists 
a, left A-submodule M of B®C and a left ideal K of A such that BK(&{ 0} C M 
and 

£R I (fc,c) GM, aeifl.l 



Let M and N be right R-modules and / : M —> N. Then / is called a 
R- homomorphism if (i) /(a + b) = /(a) + f(b) and (ii) f(ar) — f(a)r for 
all a, b £ M, r £ R. If / is a one-one function from M onto N and / is a 
R-homomorphism, then / is called an R-isomorphism or simply an isomor¬ 
phism from M onto N. M and N are isomorphic as right R-modules, if 
there exists an R-isomorphism from M onto N. Similar conventions hold for 
left R-modules. 


Theorem 20.2.3 Let R, A, B , a,nd C be defined as above. Let I = 


0 0 
B 0 


and J = 


A 0 
0 0 


Then the following assertions hold. 


(i) I is an ideal of R. ^ 

(ii) R/I ~ A 0 (7, where A 0 C is the (external) direct sum, of rings. 

(Hi) I is a. Noetherian (Artinian) right R-module if and, only if B is a, 
Noetherian (Artinian) right A-module. 

(iv) I is a, Noetherian (Artinian) left R-module if and only if B is a, Noethe¬ 
rian (Artinian) left C-m,odule. 

(v) J is a right ideal of R. 

(vi) Rj J ~ B 0 C as a right R-m,od,ule. 


Proof. (i) Let M — {0} 0 B and K — {0}. Then by Theorems 20.2.1 and 
20.2.2, / is an ideal of R. 

(ii) Define / : R —» A 0 C by 


/ 


a 0 
b c 


(a,c) 


for all 


a 0 
b c 


£ R. Clearly, / is an epimorphism. Now 


a 0 
b c 


£ Ker / if 


and only if / 


a 0 
b c 


= (0,0) if and only if (a, c) = (0,0) if and only if 
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a = 0 and c = 0. Thus, Ker f = I. Hence, 


R/I~ A® C. 


(iii) Suppose I is a Noetherian right 72-module. Let B\ C 7? 2 Q • ■ ■ be an 
ascending sequence of right A-submodules of B. Let 


R = 


0 0 
Bi 0 


z = 1,2, 


Then R is a right ideal of R and R C 7 i+1 C 7, i = 1,2,.... Thus, R C R C • • • 
is an ascending sequence of right 72-submodules of I. Since I is a Noetherian 
right 72-module, there exists a positiv^ integer n such that I n — I n+ k for all 
k > 0. Therefore, B n = B n+ *. for all k > 0. Hence, B is a Noetherian right A- 
module. Conversely, let B be a Noetherian right A-module. Let R C R C • • • 
be an ascending sequence of right i?-submodules of I. Let 


B t = 


b £ B 


0 0 
b 0 



* = 1 , 2 ,.... 


Clearly (B^,+) is an Abelian group. Let b £ Bi and a £ A. Then 


0 0 
b 0 


/, and 


a 0 
0 0 


q | £ R. Since R is a right 72-module, 

£ R. Hence, ba £ B t . Thus, B l is a right A-submodule of B,i = 


0 

0 ' 


’ 0 

0 ’ 

ba 

0 


b 

0 


1,2,-Clearly B z C B l+ i,i = 1,2,-Therefore, B\ C B 2 C ... is an 

ascending sequence of right A-submodules of B. Since B is a Noetherian right 
A-module, there exists a positive integer n such that B n = B n+ k for all k > 0. 
Thus, I n — 7 n+ fc for all k > 0. Hence, 7 is a Noetherian right A-module. 

(iv) The proof of this part is analogous to the proof of part (iii). 


(v) Let 


a 0 
0 0 


b 0 
0 0 


£ J and 


a 0 
b c 


£ 72. Then 


a 

0 " 


’ b 

0 ' 


a — b 

0 ' 

0 

0 


0 

0 


0 

0 


and 

a 0 

0 0 

Thus, J is a right ideal of 72 and hence a right 72-submodule. 



1 

0 

_j 


aa' 0 


b' c' 


1 

O 

O 

1 _ 
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(vi) Clearly R / J is a right iLmodule. Let (6, c) E B®C and 
Define a binary operation • 


a' 0 
b' c' 


E R. 


(6,c) 


a' 0 

y y 


( ba' + cb ', cc'). 


Clearly {ba! + cb',cc r ) E B ® C. It can now be easily checked that B 0 C is a 
right i?-module under •. Define 


/ : R-^ B® C 


by 


/ 


a 0 
b c 


for all 


a 0 
b c 


E R. Let 


a 0 
b c 


a 


' 0 


y 


= c ) 

E iL Then 


/ 


( 

a 

0 ’ 


a' 0 

\ ,(\ 

{ 

b 

c 

+ 

b' c' 

) = ' 


= / 


<2 ~t~ a 0 
b + b' c + c' 
(b + 6',c + c') 

(6, c) + (6', c') 
a 0 
b c 


+ / 


a' 0 


y 


Now 


/ 



a 

0 ’ 


a' 0 

\ J\ 

l 

6 

c 


r 

o- 

Ci 

1 _ 

) - nl 


aa! 0 
ba! + cb' cc' 
= {ba' + cb', cc') 
a' 0 


= 0: c) 

= / 


b' c' 

a 0 
6 c 


a' 0 
c' 


y J 


Hence, f is an iThomomorphism. Clearly / is onto B © C and Ker / 
Consequently, 

R/J -B&C.M 


= J. 


20 .2.1 Worked-Out Exercises 


<0 Exercise 1 Let R = 


a 0 
b n 


a, b E Q, n E Z > = 



Q 0 

r 

Q Z 


(i) Find all right ideals of R. 

(ii) Show that R is right Noetherian, but not left Noetherian. 
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Solution: (i) Let J be a right ideal of R. Suppose J ^ R. By Theorem 
20.2.1, there exists a right Q-submodule M of Q® Q, a right ideal K of Z such 
that {0} © KQ C M, and 


/ = 


a 0 
b c 


E R | (a,b) E M, cEK 


Since Z is a PID, K = (n) for some n E Z. Now as in Example 19.1.7(h), 
we can show that M is of the following form: 

(1) M = {0}, or 

(2) M = {0} 0 Q, or 

(3) M = Q © {0}, or 

(4) M = ((a, b)) for some a,6 G Q, a / 0, 6 / 0, or 

(5) M = Q0Q. 

Case 1. M = {0}. Since {O}0/STQ C M, K = {0}. Therefore, in this case, 
Case 2. M = {0} 0 Q = ((0,1)). In this case, 


i.e., J is a principal right ideal of R. 

Case 3. M — Q 0 {0}. Since {0} ® K Q C M, K = {0}. Therefore, in this 
case, 

r 0 0 
1 0 


J 


R, 


i.e., J is a principal right ideal of R. 

Case 4. M = ((a, b )) , for some a, b E Q, a ^ 0, b ^ 0. Since {0} 0 KQ C 
M, (0,n) E M. Thus, (0,n) = u(a,b) for some u E Q. Hence, ua = 0 and 
7i = ub. Since a ^ 0, ua = 0 implies that u — 0 and hence 77 , = 0. Thus, 
K = {0}. Therefore, in this case, 


J = 


a 0 
6 0 


R, 


i.e., J is a principal right ideal of R. 
Case 5. M = Q 0 Q. 


Case 5a. K ^ {0}. Then n ^ 0. Let 
c = nm for some m E Z. Now 


a 0 
b c 


E J. Then c E K and so 


’ a 0 ' 


" 10 " 


a 0 

b c 


1 n 


^ 777 . 
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Hence, ^ 

b c 

C J. Therefore, 


1 0 
1 n 


R. Thus, J C 


1 0 
1 n 


1 0 
1 n 


R. Clearly 


1 0 
1 n 


Consequently, J is a principal right ideal of R. 

/-I_CU TS __ fn’l TL - T _ . f Q ^ ! 


Case 5b. K = {0}. Then J — 


Q 0 


In this case, J is generated by 


1 0 
0 0 


0 0 
1 0 


, i.e.j J is finitely generated. 


(ii) Let / — 


0 0 

Q 0 


. Then I is an ideal of R. Now Q is a left Z-module. 


Let A k = , k is a positive integer. Then Ak is a left Z-submodule of Q. 

Since ^fpt £ A k, M C A M . Thus, 


'4iCd 2 C-c4c A k+l C 


is a strictly ascending sequence of left Z-submodules of Q. Hence, Q is not a 
Noetherian left Z-module. Therefore, by Theorem 20.2.3, I is not a Noetherian 
left i?-module. Thus, R is not left Noetherian. 

By (i), if J is a right ideal of R , then J is a finitely generated. Since every 
right ideal of R is finitely generated, R is right Noetherian. 


20.2.2 Exercises 

1. Consider the ring R — 

(i) Find all left ideals of R. 

(ii) Show that R is left Noetherian, but not right Noetherian. 

2. Consider the ring R — 

(i) Find all left ideals of R. 

(ii) Show that R is left Artinian, but not right Artinian. 
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Chapter 21 

Field Extensions 

* 


In this chapter, we study a special type of ring called a field. Results about 
fields have applications in number theory and the theory of equations. The 
theory of equations deals with roots of polynomials. It is here that our main 
interest lies. This interest leads us to an introduction of Galois theory. 

The importance of the concept of a field was first recognized by Abel and 
Galois in their research on the solution of equations by radicals. However, the 
formal definition of a field appeared more than 70 years later. The works of 
Dedekind and Kronecker seem to be responsible for the entrance of the concept 
of a field into mathematics. However, in 1910, in his paper, Algebraic Theorie 
der Koperer , Steinitz gave the first abstract definition of a field. His work freed 
the concept of a field from the context of complex numbers. 

21.1 Algebraic Extensions 

Let us recall that the characteristic of a field F is either 0 or a prime p. By 
Theorem 11.1.9, the intersection of any collection of subfields of a field F is 
again a subfield of F. Hence, a field contains a subfield which has no proper 
subfield, namely, the intersection of all its subfields. 

Definition 21.1.1 A field F is called a prime field if F has no proper sub¬ 
field. 

Theorem 21.1.2 Let F be a field. 

(i) If the characteristic of F is 0, then F contains a subfield K such that 
AT ~ Q. 

(a) If the characteristic of F is p > 0, then F contains a subfield K such 
that K ~ Z p . 
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Proof. Define / : Z — >F by 

f(n) = nl 

for all n E Z, where 1 denotes the identity of F. Then / is a homomorphism. 

(i) Suppose the characteristic of F is 0. Then Ker / = {0} and so / is 
one-one. Define /* : Q -* F by 

n- b ) = mm - 1 

for all | E Q. Let |, ^ E Q- Now | ^ if and only if ad = be if and only if 

f(ad) = f(bc ) if and only if f(a)f(d ) = f (c)f(b ) if and only if f(a)f(b)~ l — 
f{c)f(d)~ l if and only if /*(§) = /*(§). Hence, /* is a one-one function. Now 

/*(? + §) = /*(*£“) 

= f(ad + bc)f(bd) 1 

= (/(o)/w+ fwwm-'m - 1 
= /( o )/(6) 1 + /( c )/(<0 1 
= /*(!) + /*(§)• 

Also, 

r(f-s) = r(g) 

= f(ac)}(bd) 1 

= /(a)/(c)/(6)- 1 /(d)- 1 

= mm i f r .c)f(d ) 1 
= /*(!)/*(!)- 

Thus, /* is a homomorphism. Hence, Q ~ Z(/*), where Z(/*) is the image of 
/*.Let K = I(f*). 

(ii) Suppose the characteristic of F is p > 0. Now 

Z/Ker / ~ Z(/). 

Since the characteristic of F is not zero, Z(/) ^ {0}. Therefore, T{f) is a non¬ 
trivial subring with 1 of the field F. Consequently, T(f) is an integral domain 
and so Z/Ker / is an integral domain. This implies Ker / is a prime ideal of 
Z and Z ^ Ker /. There exists a prime g such that Ker / = gZ. Now gl = 0 
implies that p|g and so q = p. Hence, Z/Ker / ~ Z p . ■ 


Let L be a subfield of Q. Since F\{0} is a subgroup of Q\{0} under multi¬ 
plication, 1 E L. Hence, Z C L and so Q CL. Thus, Q has no proper subfield. 
Similarly, Z p has no proper subfield, where p is a prime. 

Thus, the subfield K of the field F in Theorem 21.1.2 is the prime subfield 
of F. 

The following theorem can be easily verified. We leave its proof as an 


exercise. 





21.1. ALGEBRAIC EXTENSIONS 


447 


Theorem 21.1.3 Let F be a field and K be a subfi,eld of F. The following 
conditions are equivalent. 

(i) K is the prime subfi,eld of F. 

(ii) K is the intersection of all subfields of F. ■ 

Let F be a field and K a subfield of F. The field F is called an extension 
of the field K. We express this by F/K and call F/K a field extension or an 
extension field. 


Definition 21.1.4 Let F/K be afield extension and C be a subset of F. Define 
K{C) to be the intersection of all subfields of F which contain K UC. Then 
the subfield K{C) of F is called the subfield of F generated by C over K. C 
is called a set of generators for K(C)/K. 

Let K[C\ be the smallest subring of F containing KuC. Since any subfield 
of F which contains K U C must contain K[C), we have that K{C ) equals 
the intersection of all subfields which contain K[C). Now K[C] is an integral 
domain since it is a subring (with identity) of a field. Thus, by Theorem 12.1.6, 

K{C) = {ab~ l | a, b E K[C], bfiO). 

That is, K(C ) is the set of all rational expressions of the elements of K[C\. 
Hence, K(C) is a quotient field of K[C). 

Let F/K be a field extension and ci,C2,... ,c n E F. Considering Definition 
21.1.4, it follows that K(c\, C2, ..., Cn) = K(c\, C2, ..., c n _i)(c n ). Recall that 
K(c\) = {ab~ l | a, b E K[c\\, b 7^ 0 } ; 

Definition 21.1.5 Let F/K be a field, extension. An element a E F is said to 
be algebraic over K if there exist ko, k\, ..., k n E K, not all.zero, such t,ha,t 
ko T k\a+ ■ ■ ■ + k n a n = 0; otherwise a is called transcendental over K. 

Let F/K be a field extension and let qEF. Then a is algebraic over K if 
and only if a is a root of a nonzero polynomial with coefficients from K. 

Example 21.1.6 The element y/2 in R is algebraic over Q since y/2. is a, root 
of x 2 — 2 E Q[x]. The element i E C is algebraic over R and Q since i is a 
root of x 2 + 1 E Q[x]. 


Example 21.1.7 It can be shown that re, e E R are transcendental over Q. In 
the quotient field F(x) of the polynomial ring F[x], F a field, x is transcendental 
over F since ^=0 a i x% ~ 0 if an d on ^y a i — 0 for i = 0,1,..., n. 


Theorem 21.1.8 Let F/K be afield extension and c E F. Then c is algebraic 
over K if and only if c is a root of som.e unique irreducible monic polynomial 
p(x) over K. 
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Proof. Suppose c is algebraic over K. There exists a nonzero polynomial 
f{x) G K[x\ such that c is a root of f{x) and f(x) £ K. By Theorem 16.1.15, 
there exist irreducible polynomials f\(x) t / 2 (x),...,/ m (x) G K[x\ such that 
f{x) = h(x)f 2 {x) - ■ • fm(x). Thus, 

0 = /(c) = /l(c)/ 2 (c) • • • /m(c). 

Since F has no zero divisors, we must have fife) — 0 for some i. Thus, there 
exists an irreducible polynomial h(x) — 6o + ^i^+ ■ • • + 5 m x m , b m 0 , such that 
h(c ) = 0. Let p(x) — b^fh(x). Then p{x) is an irreducible monic polynomial in 
K[x] with c as a root. 

Let g(x) be any polynomial in K[x\, which has c as a root. Let p{x) be 
a monic polynomial of smallest degree in AT[x], which has c as a root. There 
exist q{ x), r(x) G K{x\ such that g{x ) — q{x)p{x) + r(x), where either r{x) = 0 
or degr(x) < degp(x). Now 

0 — g(c) — q{c)p(c ) + r(c) — q{c) • 0 + r(c). 

Thus, r(c) = 0, whence r{x) — 0 else we contradict the minimality of the de¬ 
gree of p(x). This implies that p{x)\g{x) in K{x\. Let s(x) be any irreducible 
polynomial in K[x], which has c as a root (one such polynomial is fi(x) for 
some i, 1 < i < m). Then p(x)[s(x). Now p(x) is not a constant polynomial in 
K[x] since it has c as a root. Thus, since s(x) is irreducible in K[x\, p(x) must 
be irreducible in K[x\. Also, p(x) = ks(x) for some k E K. If we choose s{x) 
monic, then k = 1 and so we have the desired uniqueness property of p{x). The 
converse is immediate. ■ 

The proof of Theorem 21.1.8 yields the next result. 

Corollary 21.1.9 Let FJK be a field extension and c € F be such that c is 
algebraic over K. Then the unique monic irreducible polynomial p{x) over K 
having c as a root satisfi.es the following properties: 

(i) There is no polynomial g{x) G K[x] having smaller degree thanpfx) and 
which has c as a root. 

(n) tfc is a root of som.e g(x) G K[x\, then p{x)\g(x) in K[x\. ■ 

We call the polynomial p(x), in Corollary 21.1.9, the minimal polynomial 
of c over. K. The degree of p(x) is called the degree of c over K 

Example 21.1.10 By Examples 21.1.6,15.3.6, and 15.3.7, we have that x 2 — 2 
is the minim,a,l polynomial of y/2 over Q and x 2 + 1 is the minimal polynomial 
of i over R. 
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Theorem 21.1.11 Let F/K be a field extension and c G F. 

(i) If c is transcendental over K , then K(c ) ~ K(x), where K(x) is the 
quotient field of the polynomial ring K[x]. 

(ii) If c is algebraic over K , then K[c] ~ K[x}/ {p(x)) , where p{x) is the 
m,inim,al polynomial of c over K. 

Proof. Define the mapping a : K[x} —> K[c\ by for all f(x) G K[x], 

<*(/(*)) = /( c )- 

Then by Theorem 14.1.14, a is a homomorphism of K[x\ onto K[c\. Thus, 

K[x\/ Ker a ~ K[c\. 

(i) Now f[x) G Ker a if and only if /(c) = 0, i.e., if and only if c is a root 
of /(x). Hence, Ker a = {0} if and only if c is transcendental over K. Thus, c 
is transcendental over K implies a is an isomorphism of K[x\ onto K[c\ and so 
by Exercise 5 (page 324), a can be extended to an isomorphism of K{x ) onto 
K{c). Consequently, if c is transcendental over K, then K(x) ~ K(c). 

(ii) Suppose c is algebraic over K. Since K[x\ is a principal ideal domain, 
there exists g{x) G K[x\ such that Ker a = (g{x )) . Now a(g(x)) — g(c) — 0. 
Hence, c is a root of g{x). Thus, p(x)\g(x) and so there exists q{x) G K[x\ such 
that g(x) — q{x)p(x). This implies that g{x) G (p(z)) anc ^ so 

Ker a = (g(x)) C (p(x)) . 

Since p(c) — 0, p(x) G Ker a. Therefore, {p(x)) C Ker a. Consequently, Ker 
a = (p(x)). ■ 

Corollary 21.1.12 Let F/K be a. fi.eld, extension a,nd, c G F. Then 

(i) K[c\ C K(c) if and only if c is transcendental over K , 

(ii) K[c] — K(c ) if and only if c is algebraic over K. 

Proof. Since K[c] C K(c) always holds, (i) and (ii) are equivalent state¬ 
ments. Hence, we show that (ii) holds. Suppose c is algebraic over K. Then 
by Theorem 21.1.11, 

K[c\ ~ K[x]/ {p(x)) 

and since p(x) is irreducible, K[x]/ (p(x)) is a field. Thus, K[c ] = K(c). Con¬ 
versely, suppose K[c\ = K(c). If c = 0, then c is the root of the polynomial 
x G K[x\. Suppose that c ^ 0. Then c~ l G K(c ) and so c~ l — Uq + k\c-\- 

-(- k n c n for some k z G K. This implies that 0= — 1 -f fcoc+ Aqc 2 H-1- k n c n+l 

and so c is algebraic over K. ■ ' 
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Let F/K be a field extension. Under the field operations of F, F can be 
considered as a vector space over K. The elements of F are thought of as 
“vectors” while those of K are thought of as “scalars.” Recall that (F, +) is a 
commutative group and that for all k\,k 2 E K and 01,(22 E F, k\(a\ + 02 ) = 
k\a\ + kia, 2 , ( k\ + k 2 )ai = k\a\ + 1 hold from the distributive laws and that 

{k\k 2 )a\ = ki(k 2 di) holds from the associative law of multiplication. 

Definition 21.1.13 Let F/K be a field extension. The dimension of the vector 
space F over K is called the degree or dimension of F/K and is denoted 
by [E : K]. If the dimension of F/K is finite, then F/K is called a finite 
extension. 

Theorem 21.1.14 Let F/K be afield extension and c E F be algebraic over 
K. Let p{x ) be the minimal polynomial of c over K. If deg p(x) — n, then 
{1, c, c 2 ,..., c 71-1 } is a basis of K(c)/K. 

Proof. By Corollary 21.1.12, K[c] = K{c). Let g(c) E K[c\ and g{x) be 
the corresponding element in K[x\. There exist q(x), r{x) E K[x] such that 
g(x) = q(x)p(x) + r(x), where either r(x) = 0 or deg r(x) < degp(x). Thus, 
g{c) = g(c)p(c)+r(c) = r(c). Hence, {1, c, c 2 ,..., c 71-1 } spans K(c)/K. Suppose 
0 = Y/Tq kiC z , ki E K. If the kfs are not all zero, then c is a root of a polynomial 
of degree < n — 1 < n, a contradiction. Thus, = 0 for i = 0 , 1, .. . , n — 1 and 
so {1, c, c 2 ,..., c n_1 } is linearly independent over K. Hence, {1, c, c 2 ,..., c n_1 } 
is a basis of K{c)/K. ■ 

Corollary 21.1.15 Let F/K be a fi.eld extension. If c £ F is algebraic and of 
degree n over K , then [K{c) : K) — n. ■ 

Example 21.1.16 The field extension Q(\/2)/Q is of degree 2 and {1, \/2} 
is a basis of Q(\/2) over Q since p(x) = x 2 — 2 is the minim.al polynomial of 
\[2 over Q by Example 21.1.10. Thus, Q(\/2) = {a 4- b\/2 | a, b E Q}. 

The student may recall from another mathematics course that a + by/2 = 
c + dyf 2 if and only if a = c and b = d, where a,b,c,d € Q. This becomes clear 
now since 1 and y/2 are linearly independent over Q by Theorem 21.1.14. 

Example 21.1.17 By Theorem 21.1.14 , the field extension R(i)/R is of degree 
2 and. {1, i} is a basis of R(i) over R since p(x) = a; 2 + 1 is the minim,al 
polynomial of i over R. Thus, R(z) = {a + bi I a, b E R}. Hence, we see tha.t 
R(z) is C, the field of complex numbers. 

Theorem 21.1.18 Let F/K be a finite field extension. Then every el.em.ent 
of F is algebraic over K. 
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Proof. Let n be the dimension of F/K. Let c E F be such that c ^ 0, 
1. (Clearly 0 and 1 are algebraic over K.) If the set {1, c, c 2 , ..., c n } does 
not contain n +1 distinct elements, then c J ~ l — 1 for some i, j (0 < i < j < n ) 
and so c is a root of — 1. Suppose l,c,c 2 ,... ,c n are distinct. Then they 
must be linearly dependent since they are more in number than the dimension 
of the vector space F over K. Hence, there exist fco, k\, ..., k n E K not all zero 
such that 0 = 0 Thus, c is a root of the polynomial k l x l over K. ■ 

The converse of Theorem 21.1.18 is not true, that is, it is not necessarily 
the case that if every element of F is algebraic over K , then F/K is a finite 
field extension. It can be shown that the set of all elements A of R, which are 
algebraic over Q is a field such that [A : Q] is infinite (Theorem 21.1.22 and 
Example 21.1.25). A is called the field of algebraic numbers. 

Theorem 21.1.19 Let K{c) / K be a field extension. Then K(c)/K is finite if 
and only if c is algebraic over K. 


Proof. If K{c)/K is finite, then c is algebraic over K by Theorem 21.1.18. 
If c is algebraic over K, then K[c)/K is finite by Corollary 21.1.15. ■ 

Let F/K be a field extension. A subfield L of F is called an intermediate 
field of F/K if K C L C F. Since a — b € L for all a, b E L and ka E L for all 
k G K and a G L, it follows that L is a subspace of F over K. An intermediate 
field L of F/K is called proper if L F. 

Theorem 21.1.20 Let F/K be a, fi,eld, extension a,nd L be an intermediate 
field of F/K. Then 

[F:K} = [F:L][L:K]. 

Moreover, F/K is a finite extension if and only if F/L a,nd, L/K are finite 
extensions. 


Proof. Let V be a basis of F/L and U be a basis of L/K. We show that 

W = {uv | u G U, v G V} 

is a basis of F/K. Let c E F. Since V is a basis of F/L, there exist v\, ..., v n E 

V and ci, C 2 ,..., c n G L such that 


n 


c = Y^CjVj- ( 21 . 1 ) 
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Since U is a basis of L/K , there exist u\, U 2 ,..., u m E U and k\j,k 2 j,..., 
kjnj E K such that 

m 

Cj = ^ ^ kijUi , j = 1, 2 ,..., n. (21.2) 

i —l 

Substituting Eq. (21.2) into Eq. (21.1), we obtain 

n m 

c = kijUiVj. 

j=li=l 

Thus, W spans F over K. Suppose 

n m 

o = Y^Y,kn u iVj, 

j=1 i=1 

where z^ E U, Vj E H, and k t j E JT for all z = 1,2,..., m; j = 1, 2,..., n. Then 

n m 

o = ( y: kij u i ) y j 

j=1 i= 1 

and since V is linearly independent over L, 

m 

0 — ^ kij Ui , j — 1, 2 ,..., n. 
i=1 

Thus, k^ = 0 for z = 1, 2, ..., m\ j = 1, 2, ..., n since U is linearly independent 
over K. Hence, W is linearly independent over K , whence W is a basis of F 
over K. Let u,u' E U and v,v' E V. If v ^ v', then uv ^ u'v 1 since v and v' 
are linearly independent over L. If v = v', then uv = u'v' if and only if u — u'. 
Consequently, for all u,v! E U and for all v,v' E V if either u u' or v ^ v', 
then uv ^ u'v'. Hence, [F : K] = \U x V\ = \U\\V\ = [F : L\[L : K\. Now if 
either U or V is infinite, then W is infinite. If U and V are finite sets, then W 
is a finite set. Hence, F/K is a finite extension if and only if FjL and L/K 
are finite extensions. ■ 

Example 21.1.21 Consider the field, extension Q(\/2, \/3)/Q- By Example 
21.1.10, x 2 — 2 is the minimal polynomial of \/2 over Q. Also, x 2 — 3 is the 
minimal polynomial of \J 3 over Q(\/2). (Thai x 2 — 3 is irreducible over Q(\/2) 
follows by an argument that is similar to the one used in Worked-Out Exercise 
l, page 454 J Thus, {l,%/2} is a basis of Q(v / 2)/Q a,nd {l,\/3} is a, basts of 
Q(\/2, V / 3)/Q(\/2). By Theorem, 21.1.20, (1, \/2, \/3, \/6} is a basis of Q(\/2. 
V3)/Q. [Q(V2, V3) : Q) = 4, [Q(V2, V3) : Q(V2)] = 2, and [Q(v/2) : Q] = 2. 

Theorem 21.1.22 Let F/K be a field extension. If L is the set of all elements 
in F, which, a,re algebraic over K , then L is an interm,ediaie fi,eld of F/K. 
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Proof. Any k € K is a root of the polynomial x — k over K. Thus, L D K. 
Let a and 6 be elements of L, where a is of degree m over K and b is of degree 
n over K. Then K(a)/K is of degree m. and K(a, b)/K(a ) is of degree at most 
n. Hence, by Theorem 21.1.20, K(a,b)/K is a finite extension. By Theorem 
21.1.18, every element of K(a,b ) is algebraic over K. Since a — b and ab~ l (for 
b fi 0) are elements of K(a, 6), a — b and a6 _1 (for 6^0) are algebraic over K. 
Thus, a — b and ab~ l (for b fi 0) E L and so L is a field. ■ 

Definition 21 . 1.23 A field extension F/K is called algebraic if every ele¬ 
ment of F is algebraic over K] otherwise F/K is called transcendental. 

Theorem 21 . 1.24 Let L be an intermediate fi.eld of the field, extension F/K. 
Then F/K is a.n algebraic extension if and only if F/L a.nd. L/K are algebraic 
extensions. 

Proof. Suppose that F/K is algebraic. Let a £ F. Then a is a root of 
a nonzero polynomial p(x) £ K[x\. Since K C L, p{x ) € L[x\. Thus, a is 
algebraic over L and so F/L is algebraic. Every element of L is an element 
of F. Hence, L/K is algebraic. Conversely, suppose F/L and L/K are alge¬ 
braic extensions. Let c £ F. Then c is a root of some nonzero polynomial 
co + c\x + ••• + c n x n £ L\x\. Thus, c is algebraic over K{cq, ci, ...,c n ) 
whence K(co, c \,..., c n )(c)/K(co, c \,..., c n ) is a finite extension. Since cq, 
Ci, ... ,c n are algebraic over K, repeated application of Theorem 21.1.20 yields 
that K(cq, ci, ..., c n )(c)/K is a finite extension. Therefore, c is algebraic over 
K by Theorem 21.1.18. Hence, F/K is an algebraic extension. ■ 

Example 21.1.25 Let F = Q({ v / p | p £ Z, p is a prime}) C R. We show 
that F/Q is algebraic and [F : Q] = oo. Now for any prime p, yfp ^ Q. 
Let pi,...,p n be any distinct prim.es. Suppose p fi pi , i = 1,2, ...,n, and 
p is a prim.e. Assum.e that yfp QCy 7 ?!, •••> y/Pn), the induction hypoth¬ 
esis. (The case n = 0 is yfp / Q and this case is described above.) We 
show that if p\,... ,p n +1 ore distinct prim.es and p fi pi, i = 1, 2, ... , n + 1, 
then yfp £ Q( v /pT,..., y/p n + 1). Suppose yfp £ Q (y/pi ,..., fip n + 1). Then there 
exist a,b £ Q(- v /pT,.. -, yfpfi) such that yfp = a + byfp n+ i. If a = 0, then 
p = 6 2 p n+ i, a contradiction since p and p n+ 1 are distinct primes. 7/6 = 0, 
then yfp = a £ Q(- V /Pi, • ■ ■, y/Pn), contradiction to our induction hypothesis. 
Suppose a/0 a.nd b fit). Then p = a 2 +p n+ ib 2 + 2abyfp n+ \. Hence, y/p n +i — 
(p - a 2 - p n+ ib 2 )/2ab £ Q{^pl,yfipfi) and so yfp £ Q{^pl,..., yfpfi), a. 
contradiction of the hypothesis. Hence, yfp Q(yfpi ,..., yfp n + 1 )- Thus, by the 
induction hypothesis, we find that for any positive integer k, if p \,... ,pk, p are 
distinct prim.es, then yfp fi Q(y / pl, • •., yffik)- Hence, 

Q C Q(v / 2) C Q(\ / 2, VS) C ■ • • 

TRilHjc. 
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is an infinite strictly ascending chain of intermediate fields of F /Q. Hence, 

F /Q must be of infinite dimension. Let a E F. Then there exist primes pi, ..., 
p n such that a E Q(y/pi, • ■ -, >/pL)- Since Q(y / pi, ..., y/fin )/Q is a finite field 
extension, a is algebraic over Q by Theorem, 21.1.18. Hence, Ff Q is algebraic. 

Note that from, this example, it follows that [R : Q] = oo. 

The above example provides us with a held extension Ff Q which shows 
that the converse of Theorem 21.1.18 is not true. Since the held of algebraic 
numbers A contains F, we have [A : Q] = oo. 

/ 

Definition 21.1.26 Let F/K and L/K be field extensions and a : F —> L be 
a, h,omom,orphism. Then a is called a K -homomorphism, if a [a) = a for all 
a E K. 

Let F/K and L/K be held extensions and a : F —*■ L be a F'-homomorphism. 

Since <r is a nonzero homomorphism, Ker <r ^ F. Therefore, Ker a = {0} since 
the only ideals of F are F and {0}. This implies that a is one-one. Hence, a 
is an isomorphism of F onto cr(F). We simply call a a FT-isomorphism of F 
into L. If L = F = a(F) and a is a FT-isomorphism of F into F, then we call 
c7 a F-automorphism. 

Theorem 21.1.27 Let F/K be an algebraic extension and a : F —> F be a 
K-homomorphism. Then a is an automorphism,. 

Proof. As above a is one-one. To show a is an automorphism, it only 
remains to be shown that cr(F) = F, i.e., a is onto F. 

Let a 6 F. Let fix') = ao + a i x + • • • + a,kx k £ FT[x] be the minimal 
polynomial of a over K. Let b be any root of f{x) in F. Then f(cr(b )) = 
ag + aicr(6) + • • • + akcr(b) k = a(ao + a\b +■■'•+ afifi) = 0. Hence, a(b) is a 
root of f(x). Let F' be the subheld of F generated by all roots of f{x) over 
FT that lie in F. Then F' jK is a hnite extension. Since o maps a root of f[x) 
to a root of f(x), a maps F' into F'. Since [F' : K] = [cr(F') : K], it now 
follows that [F' : cr(F')} = 1 by Theorem 21.1.20 and so F' = cr(F ; ). Hence, 
a E F' = &(F') C <t(F). Thus, a is onto F. ■ 

21.1.1 Worked-Out Exercises 

0 Exercise 1 Show that the polynomial x 2 — 7 is irreducible in Q(\/3)[ab¬ 
solution: Suppose x 2 — 7 = {x — (a+6\/3))(a: — (c-t-d\/3)), where a,b,c,d E 
Q. Then x 2 — 7 = x 2 — ((a + c) + [b + d)y/S)x + {ac + 3bd + ady/3 + 6c\/3)- This 
implies that 

(a + c) + (5 + d )\/3 = 0 

ac + 3bd + ady/3 -I- bc\J 3 = —7. 

“Pi 7feaiAe»tattea/ “Phy-SSLc-A. 
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Since {1, V^} is linearly independent over Q, a + c — 0 and 6 + d — 0. Hence, 

-a 2 - 3 b 2 + (—2 ab)V3 = -7. 

Thus, — a 2 — 36 2 = —7 and —2 ab — 0. Hence, ab = 0. Suppose a = 0. Then 
3 b 2 — 7. Now b — — for some integers m and n with gcd(m, n) — 1. Therefore, 
3m 2 = 7n 2 , which contradicts the fundamental theorem of arithmetic. Suppose 
6 = 0. Then o 2 = 7, which again leads to a contradiction of the fundamental 
theorem of arithmetic. Thus, x 2 — 7 is irreducible in Q(\/3)[x]. 

<£> Exercise 2 Find [Q(v / 3, \/7) : Q(v / 3)] and [Q(v / 3) : Q]. Also, find a basis 
for Q(\/3, V / 7)/Q(v / 3) and a basis for Q(\/3, a/7)/Q. 

Solution: By Worked-Out Exercise 1 (page 454), m 2 — 7 is irreducible over 
Q(\/3). Thus, 

[Q(V3, y/7) : Q(\/3)] - deg(:r 2 - 7) = 2. 

By Theorem 21.1.14, {1,^/7} is a basis for Q(\/3, \/7)/Q(\/3). Since x 2 — 3 is 
irreducible over Q, [Q(\/3) : Q] =2 and {l,\/3} is a basis for Q(\/3)/Q. Thus, 

[Q(V3, \/7) : Q] = [Q(V3, V7) : Q(>/3)][Q(V'3) : Q] = 2 • 2 = 4. 

By Theorem 21.1.20, {1, y/3ly/7, y/21} is a basis of Q(%/3, \/7)/Q. 

0 Exercise 3 Find an element u G R such that Q(\/2, y/7) — Q (u). 

Solution: We claim that u = \/2\/7. Since u = \[2y/7 G Q(\/2, \/7), 
Q(u) C Q(v / 2, yj7). Now V2?/7 G Q(n) implies that 14^2 = (v^v^) 3 G Q(n). 
Hence, \/2 G Q(u). Since \/2, \/2\/7 G Q(n), \/7 G Q(u). Therefore, Q(\/2, 
y/7) C Q(«). Thus, Q(y/2, #7) = Q(u). 

<0> Exercise 4 (i) Let F be a field and a, 6 be members of a field containing F. 
Suppose that a and 6 are algebraic of degree m and n over F, respectively. 
Suppose m and n are relatively prime. Show that [F(a, 6) : F] = mn. 

(ii) Show that the result in (i) need not be true if m. and n are not 
relatively prime. 


Solution: (i) Let f{x ) G F[x] be the minimal polynomial of a of degree 
m. Now f(x) G F[x] C F(b)[x]. Thus, a satisfies a polynomial of degree m over 
F(b). Hence, [F(6)(a) : F(6)] < m. Since F(6)(a) = F(a,b), [F(a,6) : F(6)] < 
m. Now [F(a, 6) : F] = [F(a, 6) : F{b)} [. F(b ) : F] < mn. Also, 


[F(a, 6) ; F] = [F(a,6) : F(6)][F(6) : F] = [F(a,6) : F(6)]n. 

Thus, n|[F(a, 6) : F], Similarly, m,|[F(a,6) : F]. Since m and n are relatively 
prime, mn\[F(a : b) : F]. Therefore, [F(o,6) : F] > mn. Consequently, [F(a, 6) : 
F] -= mn. 
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(ii) Let F — Q, a — 2e , and 6 = 24. Then a is algebraic over F of degree 
6 and 6 is algebraic over F of degree 4. We claim that F(a,b) = F( 2 12 ). Now 
6 = (2rs) 3 e F(2w) and a = (2^) 2 e F{ 2w). Thus, F(a,6) C F(2^). Now 
2^ = 24 6 = 2 4 (2 6 ) —1 6 F(a,b). Hence, F(a,b) = F(2vi). Since x 12 — 2 is 
the minimal polynomial of 2 12 , [F( 2 T 2 ) : F] = 12 24 = 4 • 6. 

0 Exercise 5 Consider the unique factorization domain F[t\, where F is a 
field and t is transcendental over F. Show that the polynomial x 2 +tx+t e 
F(t)[x\ is irreducible over F(t). Also, show that x 2 + tx + t £ F(x)[t] is 
irreducible over F(x). 

Solution: Now t / but t 2 / 1. Note t is prime in F[t]. Thus, x 2 + 
tx + t £ F(t)[x] is irreducible over F(t) by Eisenstein’s criterion. If we consider 
x 2 + tx + t as a polynomial in t over F(x), then x 2 + tx + t = (x + l)i + x 2 . It 
follows that Eisenstein’s criterion does not apply. However, since (x + 1 )t -j- x 2 
is of degree 1 in i, it is irreducible over F(x). 

Exercise 6 Let K[u, u] denote the polynomial ring in two algebraic indepen¬ 
dent indeterminates u, v over the field K. Let F denote the field of quo¬ 
tients K{u,v) of K[u,v], Prove that the polynomial x 2 + vx + u is irre¬ 
ducible over F. 


Solution: Suppose x 2 + vx + u is reducible over F. Then 


x 


2 


+ vx + u 



p(u,v) \ 

q(u,v)J 



9(u,v)J 




wherep(u, u), q(u, v), /(u, v), g(u, v ) £ K[u , v\. We may assume that p(u, v) and 
q(u,v) are relatively prime in K[u,v] and also f(u,v) and g(u,v) are relatively 
prime in K[u,v\. Now 


uq(u,v)g(u,v) = p(u, v)f(u, v). 


(21.3) 


Hence, g(u,v) divides p(u, v), p(u,v ) divides ug{u,v ), q(u,v) divides /(u, v)> 
and f(u,v) divides uq(u,v). Also, 


Consequently, 


P(u,v) 

q{u,v) 


+ 


f(u,v) 

g(u,v )' 


vq(u, v)g(u , v) = p(u, v)g(u, v ) + q(u, v)f(u , v). (21-4) 

Therefore, g(u,v ) divides q(u,v ) and q(u,v ) divides g(u,v). Thus, 

g(u,v) = kq(u,v) 

7^a^Lc#fto£uiaZ 
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for some k £ K. Hence, g(u,v) and p(u,v) are relatively prime. Similarly, 
q{u,v) and f(u,v) are relatively prime. Thus, p(u,v) divides u and f(u,v ) 
divides u by Eq. (21.3). Hence, 

either p(u, v ) = k\u or p(u,v) = Aq, (21.5) 

either f(u,v ) = k 2 U or f(u,v) = k 2 (21.6) 

for some Aq,A ?2 G K. Suppose that p(u,v) = k\u and f(u,v) — k 2 U. Then 
substituting into Eq. (21.4) we obtain 

vq(u, v)g(u, v ) — k\ug{u , v) + /c 2 uq(u, v). 

Thus, 

vq(u, v)g{u , v) — k\ukq{u , v) + k 2 uq(u , v). 

Hence, vg(u,v) = {k\k + Aq)u. However, this contradicts the algebraic inde¬ 
pendence of u, u over A7 Substituting the remaining possibilities in Eqs. (21.5) 
and (21.6) into Eq. (21.4), we also obtain a contradiction of the algebraic 
independence of u , v over K. Thus, x 2 + vx + u is irreducible over F. 

Exercise 7 Let F = K(x,y ), where K is a field and x,y are algebraically 

independent indeterminates over K. Show that F ^ K(x)K(y), where 

K(x)K{y) = {Ei(jf>i(x)/qi(x))(ui(y)/vi(y)) \pi(x),qi(x) eK{x], 
Ui(y),Vi(y) G K[y\,qi(x) ^ 0 ,Vi(y) / 0}. 

Solution: Now ^ £ K(x)K{y) else ^ - {Hi{fi{x)gi{y))/h(x)k{y) t 
after obtaining a common denominator. Thus, 

h(x)k(y) = (x + y)(%2(fi(x)gi(y)). 

i 

This implies that x + y divides h(x)k(y). Hence, x + y divides h(x) or k{y) since 
x+y is prime in the UFD K[x,y\, a contradiction of the algebraic independence 
of x, y over K. 

21.1.2 Exercises 

1 . Show that Q(\/3, — \/3) = Q(\/3). 

2. Let F/K be a field extension. Show that [F : K] = 1 if and only if 
F = K. 

3. Consider the field extension R/Q. 

(i) Show that 7r 2 is transcendental over Q. 

(ii) Show that y/E is transcendental over Q. 

~Pi mJic. 
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4. Consider the field extension R/Q. Show that 7r —3 is transcendental over 

Q. 

5. Consider the field extension R/Q. Show that it is transcendental over 

Q(V2). 

6 . Consider the field extension R/Q. Show that 7r + y/2 is transcendental 
over Q. 

7. Let F/K be a field extension such that [F : K] < oo. Let p(x ) be an 
irreducible polynomial in K[x\. Suppose p(c) = 0 for some cGh Prove 
that deg p(x) divides [F : K]. 

8 . Find. [Q(^5) : Q]. 

9. Show that Q(\/3 — \/5) = Q(\/3, y/F). Find [Q(\/3 — \/5) : Q]. 

10. Show that the polynomial rr 2 — 5 is irreducible over Q(\/2). 

11. Find the minimal polynomial of \J2 + \/5 over Q. 

12 . Let c = \/3. Show that Q(c) = Q(c 2 ). 

13. Find [Q(V2,V5) : Q(V^)], [Q^,^) : Q], a basis for Q(v / 2,v / 5)/ 
Q(\/2), and a basis for Q(\/2, \/5)/Q. 

14. Let F/K be a field extension and c E F be algebraic over K. Let f(x) € 
K[x\. Show that /(c) is algebraic over K. 

15. Prove that if [F : K] = p, p a prime, then F/K has no proper intermediate 
fields. 

16. Let L and M be intermediate fields of the field extension F/K. Suppose 
that [L : K] is a prime. Prove that either LC\M = K or LCM. 

17. Let F/K be a field extension, f{x) be a nonzero polynomial in K\x\, and 
c E F. If /(x) is algebraic over K , prove that c is algebraic over K. 

18. Let F/K be a field extension such that [F : K] = p, p a prime. Prove 
that if c € F, c K, then F = K(c). 

19. Let F/K be a field extension and a, b G F be algebraic over K. If a has 
degree m, over K and b ^ 0 has degree n over K , prove that the elements 
a 4- 6, ab, a ~ b , ab~ l have degree at most mn over K. 

20. Prove that y/2 + y/3, y/2 — y/3 have degree 4 over Q and that V2\/3, 
\/2/\/3 have degree 2 over Q. Find the minimal polynomials of these 
elements over Q. 

IPufic. 
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21. Let F/K be a field extension and R be a ring such that K C R C F. 
Prove that if every element of R is algebraic over K , then R is a field. 

22. Let F/K be a field extension and u,v £ F. 

(i) Prove that K(u, u + v) — K (ii, v). 

(ii) If u and u + v are algebraic over K, prove that [K(u,v) : K] is finite 
and v is algebraic over K. 

23. Answer the following statements true or false. If the statement is true, 
prove it. If it is false, give a counterexample. 

(i) Let F/K be a field extension and L be an intermediate field of F/K. 
Let V be a basis of F/L such that 1 G V and U be a basis of L/K such 
that 1 eU. Then U U V is linearly independent over K. 

(ii) Let F/K be a field extension and L be an intermediate field of F/K. 
Let V be a basis of F/L and U be a basis of L/K. Then U UP is a basis 
of F/K. 

(iii) Let F/K be a field extension and c,d G F. If K(c,d ) = K(c ), then 
d = /(c) for some polynomial f(x) G K[x\. 

21.2 Splitting Fields 

Here we give some results concerning the existence of field extensions which are 
generated by roots of polynomials. These results are basic to Galois theory. 

Consider the polynomial ring K[x\ over the field K. Let f(x) G K[x\. In the 
quotient ring K[x\/ (f(x)) , we let g(x) denote the coset g{x) + (f(x)) . Thus, 
if g(x) — Y/i=okiX l , then by the definition of addition and multiplication of 
cosets, we have that g(x) = J2?=o kiX 1 . 

Theorem 21.2.1 (Kronecker) Let K be a field. If f(x) is a nonconstant 
polynomial inK[x], then there exists a field extension F/K such that F contains 
a root of f(x). 

Proof. Since K[x\ is a unique factorization domain, there exist irreducible 
polynomials fi(x ),..., / n (x) G K[x\ such that f(x) = fi(x) ■ ■ • f n (x). Thus, a 
root of any fi(x), i — 1,2,... ,n, is a root of f(x). Hence, it suffices to prove the 
theorem for f(x) irreducible in K[x\. The ideal (f(x)) is maximal in K[x] and 
so F = K[x]f (f(x)) is a field. Let a be the natural homomorphism of K[x\ 
onto K\x\j (f(x)). Since KD(f(x)) = {0}, o maps K one-one into F. Thus, say, 
K C F, that is, we identify k G K with k in F. Hence, a{f(x)) — f(x) = f(x), 
where f(x ) = f(x) + (f(x)) and x = x + (f(x)) . Now a(f(x)) = 0 and so 
f(x) = 0. Therefore, x is a root of f(x). ■ 
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The field extension F/K in Theorem 21.2.1 has some interesting proper¬ 
ties. Consider the subring K[x\ of F. Then o^x z ) — YliLo^iX 1 for all 

Kx 1 C K[x] and so a maps K[x] onto K[x\. Since a also maps K[x] onto 
F, we have F = K[x\ — K(x). Thus, for /(x) irreducible in K[x\, we have 
by Theorem 21.1.14 that [F : K\ = n and {l,x,... ,x n-1 } is a basis of F/K, 
where n = deg /(x). 

Example 21.2.2 x 2 + l is irreducible in R[x]. Now C = R/ (x 2 + l) = R[x] = 
{a + bx \ a, b £ R} is a field, where x — x + (x 2 + 1). Since x 2 = —1, we m,ay 
call C the field of complex numbers. We may think ofx as i. 

Example 21.2.3 Consider the polynomial x 4 — 3 6 Q[x], By Eisenstein’s 
criterion, x 4 — 3 is irreducible in Q[x], Set X — x + (x 4 — 3) in the field 
Q[x]/ (x 4 — 3). Then 

Q[x]/ (x A — 3^ = Q(A) = {a + bX + cA 2 + dA 3 | a,b,c,d £ Q} 

and {1, A, A 2 , A 3 } is a, basis of Q(A) over Q. Let us multiply two elements of 
QW and determine the form, a + bX + cA 2 + dA 3 for their product. Consider 
(1 + A + A 3 ) and (1 + A 2 ). Then 

(1 + A + A 3 )(l + A 2 ) = 1 + A + A 2 + 2A 3 + A 5 . 

Now 

1 + x + x 2 + 2x 3 + x 5 = x(x 4 — 3) + 1 T 4x + x 2 + 2x 3 
using the division algorithm,. Thus, 

1 + A + A 2 + 2A 3 + A 5 = A(A 4 - 3) + 1 + 4A + A 2 + 2A 3 

= A-0 + l + 4A + A 2 + 2A 3 . 


Hence, 

(1 + A + A 3 )(l + A 2 ) = 1 + 4A + A 2 + 2A 3 . 

Let us find (1 -1- A + A 3 ) -1 . Since x 4 — 3 is irreducible over Q, the gcd of x 4 — 3 
and x 3 -f x + 1 is 1. Therefore, there exist s(x), t(x) £ Q[x] such that 

1 = s(x)(x 4 — 3) + t(x)(l + x + x 3 ). 


Thus, 

1 = s(A)(A 4 -3)+i(A)(l + A + A 3 ) 

1 = 0 +1 (A) (1 + A -j- A 3 ). 

Hence, i(A) = (1 + A + A 3 ) -1 . We have not really calculated t( A), however. To 
do this calculation, we must know the exact form, of s(x) andtix). The method 
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for finding s(x) and t{x ) is described below. Now by repeated use of the division 
algorithm,, we have 

x 4 — 3 — x(x 3 4 x + 1) + (— x 2 — x — 3) 

x 3 4 x + 1 = (—x 4 1)(— x 2 — x — 3) + (—x 4 4) 

— x 2 — x — 3 = (x + 5 )(—x 4 4) 4 (—23) 

—x 4 4 - (^x-4L)(-23) + 0. 

Thus, by back substitution, we obtain 

—23 = — x 2 — x — 3 — (x 4 5 )(—x + 4) 

—23 — —x 2 — x — 3 — (x 4 5)[x 3 4- x 4 1 — (—x 4 1)(— x 2 — x — 3)] 

= (— x 2 — 4x 4 6)(— x 2 — x — 3) — (x + 5)(x 3 4 x + 1) 

= (— x 2 — Ax L 6)[x 4 — 3 — x(x 3 + x + 1)] — [x + 5)(x 3 + x + 1) 

= (—x 2 — 4x + 6)(x 4 — 3) + (x 3 4- 4x 2 — lx — 5)(x 3 + x + 1). 


This implies that 

1 = — x 2 — 4x 4- 6 )(x 4 — 3) 4 - (—+ 4x 2 — lx — 5 )(x 3 4 x + 1). 

Lj O ZO 


Therefore, 


Consequently, 


1 3 4 2 7 5 

w 23 23 23 23 


(1 + a + a 3 4 = 4 + 4 a _4 a2 _^ A 3. 


Since X is a root of x 4 — 3 in Q(A), we know by Corollary 14.1.10 that x — A 
divides x 4 — 3 over Q(A). In fact, x 4 — 3 = (x — A)(x 3 4 Ax 2 4 A 2 x 4 A 3 ). We 
know there exists afield Q(A)(A 2 ), where A 2 is a root o/x 3 4 Ax 2 4 A 2 x 4 A 3 over 
Q(A) by Theorem, 21.2.1. Over the field Q(A)(A 2 ), x 3 4 Ax 2 4 A 2 x 4 A 3 factors 
into (x — \ 2 )q(x), where q(x) has degree 2. There exists a field Q(A)(A 2 )(A 3 ), 
where A 3 is a root of q{x), and over the field Q(A)(A 2 (A 3 ), q{x) factors into 
(x — As)(x — A 4 ). Thus, 


x 4 — 3 = (x — A)(x -- A 2 )(x — As)(x — A 4 ) 


over Q(A)(A 2 )(A 3 )(A 4 ). In this particular example, we can take A 2 = —A and 
so Q(A) = Q(A)(A 2 ). Hence, 

Q(A,A 2 ) A3,A 4 ) = Q(A,A3). 


Now over Q(A), 

x 4 — 3 = (x — A)(x 4 A)(x 2 4 A 2 ). 

Also, x 2 4 A 2 is irreducible over Q(A), a, fact we leave as an exercise. Thus, 
[Q(A) : Q] -4 and [Q(A)(A 3 ) : Q(A)j - 2 . Hence, [Q(A)(A 3 ) : Q] = 8 . 

“Pufuc. 
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Example 21.2.3 leads us to believe that given any polynomial f(x) in a 
polynomial ring K[x\ over a field K, there exists a field extension F/K such 
that f(x) factors completely into linear factors. This is indeed the case, as we 
will presently show. 

Definition 21.2.4 Let K be a field. A polynomial f{x) in K[x] is said to 
split over a field S D K if /(x) can be factored as a, product of linear factors 
in 5[a:]. A field S containing K is said to be a splitting field for /(x) over K 
if fix) splits over S, but over no proper intermediate field of S/K. 

Example 21.2.5 The field of complex numbers C is a splitting field for the 
polynomial x 2 +1 over R. This follows since x 2 +1 = (x + i)(x — i) inC[x\ and 
C/R has no proper intermediate fields because [C : R] = 2. (// C D L D R, 
where L is an intermediate field of C/R, then 2 = [C : L][L : R] and so either 
[C : L\ = 1 or [L : R] = 1. Thus, either C = L or L = R.) Note that C is not 
the splitting field of x 2 + 1 over Q since x 2 + 1 splits over Q W C C. 

Theorem 21.2.6 Let K be a field and f(x) be a polynomial in K[x\ of degree 
n. Let F/K be a field, extension. If 

f{x) = c(x - ci)(x - c 2 ) • • • (x - c n ) in F[x], 

then K(c 1 ,C 2 ,..., c n ) is a splitting fi.eld for f(x) over K. 

Proof. Since cj, C 2 ,.. ■, c n are the roots of f(x), fix) splits over K{c\, C 2 ,..., 
Cn). Let L be an intermediate field of K[c\, C 2 ,..., c n )/K such that f(x) splits 
over L. Since K[x] is a UFD, there is only one way /(x) can split over L, 
namely, f{x) = c{x — c\)(x — C 2 ) ♦ ■ ■ (x — c n ). Thus, ci, C 2 ,..., c n € L, whence 
L D K(c\, C 2 ,.. -, Cn). Hence, K(c\, C 2 , ..., c n ) is the smallest intermediate field 
over which f{x) splits. ■ 

The field Q(A, A3) of Example 21.2.3 is a splitting field for x 4 — 3 over Q. 
We now prove the existence of splitting fields. 

Theorem 21.2.7 Let K be a field and f(x) be a nonconstant polynomial over 
K. Then there is a splitting fi.eld for f(x) over K. 


Proof. If deg/(x) = 1 , then K is a splitting field for f{x) over K. Assume 
the theorem is true for all polynomials of degree n—1 (> 1). Suppose deg f(x) — 
n. There exists a field K\ D K such that K\ contains a root c\ of fix) by 
Theorem 21.2.1. Thus, fix) = (x — ci)fi(x) in K\ [x] and deg/i(x) — n — 1. 
By the induction hypothesis, there exists a field extension E/Kj such that 
/i(x) splits in E[x\. Thus, /(x) splits in E[x\, say, 


/(x) = c(x - cj)(x - c 2 ) * * • (x - Cn). 

7feaiAe»tafLca/ “PhySic-A. 
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By Theorem 21.2.6, the intermediate field K(c\, C 2 , ..., c n ) of E/K is a split¬ 
ting field for f(x) over K. ■ 

The intermediate field Q(\/3, i\f?>) of C/Q is a splitting field for z 4 — 3 
over Q. The field Q(A, A 3 ) of Example 21.2.3 is also a splitting field for — 3 
over Q. However, we cannot conclude that Q(v ,/ 3, i\/T) — Q(A, A 3 ). Hence, 
splitting fields for a given polynomial over a field are not unique. We will show, 
however, that they are unique up to isomorphism. 

Theorem 21.2.8 Let a be an isomorphism, of the field. K onto the fi,eld K'. 
Let p(x) = fco + k\x + k 2 X 2 + ■ • • + k n x n be an irreducible polynomial in K[x] 
of degree n, c be a root of p(x) in some fi,eld extension of K, a,nd p'{y) = 
a(kd) + a (ki)y + a(k 2 )y 2 + • • • + a(k n )y n be the corresponding polynomial in 
K'[y\. Then p'(y) is irreducible in K'[y\. If d is a root of p'(y) in som,e field, 
extension of K', then a can be extended to a.n isomorphism, a' of K{c) onto 
K'(d) with ol (c) = d. a' is the only extension of a such that a'(c ) = d. 

Proof. By an argument similar to the one used in the proof of Theorem 
14.1.14, a can be uniquely extended to an isomorphism a of K[x\ onto K'[y\ 
so that for every polynomial 6 q + b\x + +-b & m £ m £ K[x\, 

a(b 0 + b\x + b 2 x 2 H-h b^x™) = a(b 0 ) + a(b\)y + a(b 2 )y 2 H-h a( 6 m )j/ m . 

We leave to the reader the verification that p'{y ) is irreducible in K'[y\. 
Let (3 be the natural homomorphisms of K[x] onto K[x\/ (p(x)) and (3 r be the 
natural homomorphisms of K'[y] onto K'[y\/ ( p'{y )) . Then Ker (3 = Ker (3' o a. 
Hence, there exists an isomorphism a* of K[x\/ (p(x)) onto K'[y}/ (p'(y )) such 
that (3' o a = a* o (3. By Theorem 21.1.11 and Corollary 21.1.12, there exist iso¬ 
morphisms 7 and 7 ' of K[x\f (p(x)) onto K{d) and K'[y]/ { p'{y )) onto K'(d), 
respectively. Thus, a' is the map 7' o a* o 7 -1 . The situation is described by 
the following diagram: 


K[x\ — 

a 

— ~K'[y\ 

1/3 


1/3' 

Kfx) 

a* 

, K'M 

(P(®)) 


<p'(y)> 

P 

rj 

W 

K(c) — 

Q 

— ^K'(c') 


Let a" be any other extension of a to an isomorphism of K(c ) onto K'{d ) 
such that a"(c ) — d . Now {1, c,..., c n_1 } is a basis for K{c)/K and {1, d ,..., 
d n ~ 1 } is a basis for K'(d)/K'. We have that 

a"(Y kiC ^ = a ''(h)a''{d) = Y OL{ki)d l = a'QT k t d). 

i=0 i —0 i =0 i =0 
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Hence, a" — ex'. ■ 

Corollary 21.2.9 Let E/K be a field extension and p{x) be an irreducible 
polynomial in K[x\. If a, b G E are roots of p(x ), then K(a) ~ K(b). 

Proof. Let K = K' and a be the identity map. ■ 

From Corollary 21.2.9, we have Q(v / 3) ~ Q(iy/3) in Example 21.2.3. 

Theorem 21.2.10 Let a be an isomorphism, from, the fi.eld K onto the field 
K’ . Let 

fix) = ko + k\x + k 2 x 2 H-h k n x n 

be a polynomial in K[x\ and 

f(y) = a(ko) + Oi(ki)y + cx(k 2 )y 2 4-h oi(k n )y n 

be the corresponding polynomial in K r [y\. 

If S is a splitting field for f(x) over K and S' is a splitting field for f(y) 
over K ', then a can be extended to an isomorphism a' of S onto S'. 

Proof. The proof is by induction on deg f{x). If deg/(x) = 1, then K = S 
and K’ = < S'. In this case, we can take a' — a. Assume the theorem is true 
for all polynomials of degree less than n (the induction hypothesis). Suppose 
deg f(x) = n. Extend a to an isomorphism a of K[x\ onto K'[y\ as in Theorem 
21.2.8. Let p(x) be an irreducible factor of f{x) and c\ G S' be a root of 
p{x). Let cf x G S' be a root of a{p(x)) = p(y). Then by Theorem 21.2.8, a 
can be extended to an isomorphism ai of K{c\) onto KfiSfi). Extend a\ to 
an isomorphism of of K(c\)[x\ onto R’ic^fiy]. Now f{x) — {x — ci)fi{x) in 
K(ci)[x] and f{y) = (y-c[)f{(y) in AT(ci)[y], where f[(y) = af(/i(x)). Clearly 
S is a splitting field for fi(x) over K{c\) and S' is a splitting field for f[(y) 
over K (c 1 ). Since deg/i(x) = n — 1 = deg f[{y), ol i can be extended to an 
isomorphism of S onto S' by the induction hypothesis. I 

Corollary 21.2.11 Let fix) £ K[x\. Any two splitting fields for f(x) over K 
are isomorphic. 

Proof. Let S and S' be two splitting fields for f{x) over K. In Theorem 
21.2.10, take K = K' and a the identity mapping on K. ■ 

Definition 21.2.12 Let F/K be a field extension and a, b G F. Then a and b 
are called conjugates if a a,nd b are roots of the sam,e irreducible polynomial 
over K. 
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We ask the reader to prove that the notion of conjugates defines an equiv¬ 
alence relation on F. 

Example 21.2.13 Consider the fi.eld extension C/R. Let a, b £ R. Then a+ 
bi and its complex conjugate a — bi are conjugates in the sense of Definition 
21.2.12. This is obvious if b = 0. Suppose 6/0. Then a + bi R. Let 
f(x) = x 2 + 2 ax + (a 2 + 6 2 ). Since a + bi <£ R, [R(a + bi) : R] = 2. Now a + bi 
is a root of f{x) and f(x) must be irreducible over R. a — bi is also a, root of 

/(z)- 

In certain cases, the following theorem is useful in determining ^the irre- 
ducibility of a polynomial. ;r 

Theorem 21.2.14 Let F be a field. Let p be a prime in Z and a £ F. Then 
the polynomial x p — a is reducible over F if and only if x p — a has a root in F. 

Proof. Suppose f(x) ~ x p — a £ F[x] is reducible. Let f(x) — g{x)h{x) for 
some g(x), h(x) £ F[x], degg(x) = m, 0 < m < p, and 0 < degh(x) < p. Since 
/(x) is monic, we can take g{x) to be monic. By factoring g{x) as a product of 
linear factors in a splitting field of g{x) over F , we see that the constant term 
of g{x) is (—1 ) m d for some d £ F. Since gcd(ra,p) = 1 , there exist integers s 
and t such that 1 — sm + tp. By Theorem 21.2.1, there is a field extension of 
F which contains a root of fix)- Let 6 be such a root of f{x). 

Case 1: Suppose the characteristic of F is p. Since 6 is a root of /(x), 
bP — a. Thus, 

(x — b) p = x p — bP = x p — a 

and all the roots of f(x) equal 6 . Now every root of g{x) is also a root of f{x). 
Thus, all the m roots of g{x) are equal to 6 . Hence, 6 m = d. Now 

d s = h ms = b i- P t = bb - P t = ba -t 

Hence, 6 = d s a 1 £ F and so /(x) has a root in F. 

Case 2: Suppose that F has characteristic 0. Let c be any other root of 
f(x). Then 

(f — a — bP. 

Hence, c = 6 u, where u = c -p+ifrp-i an d u p — 1. From this, it follows that the 
roots of f{x) are of the form 


6 , bu \,..., bup —\, 

where u? = 1. As in case 1, we have that the product of the roots of g(x) is 


d — b m Ul u 2 • ■ • u m ~i = 6 Tn u, 

“Pi Maf4a##ia£tea/ 
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where v p = 1. Now 1 = sm + tp implies that 

b sm = v ~ s d s = b l ~ tp = ba~ l . 
Therefore, b = v~ s d s a t . It then follows that 

a = ip = {v^d^y = v - sp {d s ay = (, d s a y. 

Thus, d s af £ F is a root of /(x). 

The converse follows from Corollary 14.1.10. ■ 


21.2.1 Worked-Out Exercises 

0 Exercise 1 Find a splitting field S' of x 4 — 10x 2 + 21 over Q. Find [S : Q] 
and a basis for S/Q. 

Solution: Note that x 4 — 10x 2 + 21 = ( x 2 — 3)(x 2 — 7) over Q. Therefore, a 
splitting field S of x 4 — 10 x 2 + 21 over Q is Q(v / 3, \/7). Hence, [S : Q] = 4 and 
{ 1 , \/3, V7, \/21} is a basis for S/Q, as can be seen from Worked-Out Exercise 
2 (page 455). 

0 Exercise 2 Show that the splitting field of x p — 1 over Q is of degree p — 1, 
where p is a prime. 


Solution: Let f(x) — x p — 1 € Q[a:j. Now f(x) = (x — 1 )g(x), where 
g{x) = x p ~ l + x p ~ 2 + • • ■ + x + 1 . Also, 


9(x) 


x p -l 
x — 1 


Hence, 


g{x + 1 ) 


(x + 1 ) ? 


= x p " 


x 


+ 


IX P ~ 2 + 


+ 



Now since p is prime, p |( p ) for all 1 < r < p~ 1. Also, p 2 does not divide (p^)- 
Therefore, by Eisenstein’s criterion, g{x + 1) is irreducible over Q. Thus, g(x) 

27 ri 

is irreducible over Q. Let £ = e p , where = — 1 . Then the roots of fix) 
are 1,£,£ 2 ,... ,£ p_1 and the roots of g(x) are £,£ 2 ,... ,£ p-1 . Now the splitting 
field of f[x) is S = Q(l,£, £ 2 ,.-.,£ p_1 ) = Q(£). Also, p(x) is the minimal 
polynomial of £ over Q. Hence, [S : Q] —p — 1 . 


0 Exercise 3 Find the splitting field of the following polynomials over Q. 

(i) x 4 + 1 . 

(ii) x 6 + x 3 + 1 . 
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Solution: (i) Let /(x) = x 4 +1 . Then f(x) = (x 2 + y/2x 4 - 1) (a: 2 — \[2x +1) 
over Q(\/2). Therefore, the roots of /(x) are 

—\/2 d= i\J 2 \/2 i i\/2 
2 ’ 2 ' 

Let 5 be the splitting field of /(x) over Q. We claim that S = Q(\/2,i). Now 

V2 = —— ^ + V2--n/2 
2 2 

and 

/^. y/2 — iy /2 

V2* =-2-2- 6 5 - 

This implies that i = € 5. It now follows that Q(\/2, i) C 5. Clearly S C 

Q(V2 ,i). Consequently, 5 = Q(\/2, i). Now x 2 — 2 is the minimal polynomial 
of over Q and x 2 +1 is the minimal polynomial of i over Q. In fact, x 2 + 1 is 
the minimal polynomial of i over Q(\/2). Thus, [S : Q ] = [S : Q(v / 2)][Q(v / 2) : 
Q] = 2 ■ 2 = 4. 

(ii) Let /(x) = x 6 + x 3 + 1. Now (x 9 — 1) = (x 3 — l)(x 6 + x 3 + 1). The 
roots of (x 9 — 1) are 1,£, £ 2 ,... , £ 8 and l,£ 3 ,£ 6 are the roots of (x 3 — 1), where 
£ = e 2 ^. Hence, £, £ 2 , £ 4 , £ 5 , £ 7 , £ 8 are the roots of x 6 + x 3 + 1. Therefore, 
S = Q(£, £ 2 , £ 4 > £ 5 , £ 7 , £ 8 ) = Q(£) is the splitting field of x 6 + x 3 + 1 over Q. 
Since x 6 + x 3 + 1 is irreducible over Q, [S : Q] —6. 


21.2.2 Exercises 

1. Prove that the polynomial p'(y) in Theorem 21.2.8 is irreducible in K'[y\. 

2. Let F/K be an algebraic field extension. Define ~ on F by for all a, 
b 4 F, a ~ b if and only if a and b are conjugates. Prove that ~ is an 
equivalence relation. 

3. (i) Show that the polynomials x 2 — 2x — 1 and x 2 — 2 have the same 
splitting field over Q. 

(ii) Find a pair of polynomials in Q[x], other than the pair given in (i), 
which have the same splitting field over Q. 


4. Find a splitting field S of the polynomial x 3 — 3 over Q. Find [S : Q] and 
a basis for S/ Q. 

5. Find a splitting field S of the polynomial x 2 +x+[l] over Z 5 . Find [5 : Z 5 ] 
and a basis for S/Z 5. 


6 . 


Find a splitting field S of the polynomial x 2 + [1] over Z2. Find [S : Z2] 
and a basis for S/Z 2 . 
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7. Find a splitting field S of the polynomial x 4 - 7x 2 + 10 over Q. Find 
[S: Q] and a basis for S/ Q. 

8 . Prove that Q( — \ + ^fi) is a splitting field of the polynomial x 4 + x 2 + 1 
over Q. Find [Q(—^ + ^i) : Q]. 

9. Let f(x) £ K[x], a polynomial ring over the field K. Let S be a splitting 
field for f{x) over K. Prove that for any field L, S D L D K, S is a 
splitting field of f(x) over L. 

10. Let f{x) } g{x)j and h(x) £ K[x], a polynomial ring over the field K. 
Suppose that S is a splitting field of f(x) over K and f{x) = g(x)h(x). 
Prove that S contains a splitting field of g[x) over K. 

11 . Let f{x),g{x) £ K[x] y a polynomial ring over the field K. Suppose that 
g(x) = f(ax + b ), where 0 ^ a, b £ K. Prove that f{x) and g{x) have 
equal splitting fields over K. 

12 . Prove that if f{x) is a polynomial in K[x] of degree n, then [S : K] < n!, 
where S is a splitting field of f(x) over K. 

13. Let AT be a field and fi{x ), /2 (x),..., / n (x) G K[x\ be such that deg fi{x) 
> 1, 1 < i < n. Show that there exists a field extension F/K such that 
each fi(x) has a root in F. 

14. Let A be a field of prime characteristic p and a £ F. Prove that x p — x — a 
is reducible over F if and only if x p — x — a has a root in F. 

15. Answer the following statements, true or false. If the statement is true, 
prove it. If it is false, give a counter example. 

(i) Let f[x) be an irreducible polynomial of degree n over a field K of 
characteristic 0. Let S = A"(ci, C2 ,..., c n ) be a splitting field of fix) over 
K , where ci, C2 ,... ,c n are the roots of fix). Then AT(c2 ,..., c n ) C S. 

(ii) The polynomial fix) = x 5 — x — 30 is reducible over Q. 

(iii) C is a splitting field of some polynomial over Q. 


21.3 Algebraically Closed Fields 

The most important result in Steinitz’s work in 1910 was his proof of the 
existence and uniqueness of an algebraic closure of a field. In this section 1 , we 
present these results. 

1 This section may be skipped without any discontinuity. The only place this section is 
needed is in Exercise 4 (Section 24.1). 
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Definition 21.3.1 A field K is called algebraically closed if for all f(x) £ 

K[x\ with deg f(x) > 1, f{x) has a root in K. 

Theorem 21.3.2 Let K be afield. The following conditions are equivalent. 

(i) K is algebraically closed. 

(ii) Every irreducible polynomial in K[x\ is of degree 1. 

(lii) Let f(x) £ K[x\, degf(x) > 1. Then f(x) splits as a product of linear 
factors over K. 

(iv) If F/K is an algebraic field extension, then F = K. 

Proof. (i)=^(ii) Let p(x) £ K[x\ and p{x) be irreducible. By (i), there exists 
a £ K such that p{a) — 0. Then p(x) = (x — a)g(x) for some g(x) £ K[x\. 

Since p{x) is irreducible, g{x) £ if. Hence, degp(x) = 1. 

(ii) =>(iii) Let f(x) £ K[x ] and degf(x) > 1 . Let f(x) = pi(x) ■ • -p s (x), 
where pfix) £ K[x} is irreducible, 1 < i < s. Then d egpi(x) = 1, 1 < i < s. 

We may write pfix) = kfix — a^), where hi, a* E K, 1 < i < s. Let k = k\ ■ * • k s . 

Then f(x) = k(x — ai ) • ■ • (x — a s ). Thus, f(x) splits as a product of linear 
factors over K. 

(iii) =>(iv) Let F/K be an algebraic field extension. Let c £ F and let 
p(x) £ K[x] be the minimal polynomial of c over K. Since p(x) is irreducible, 
degp(z) = 1 by (iii). Therefore, p(x) — ax-\~b £ K[x]. Since p(c) = 0, ac-\-b = 0. 

Thus, c = — a~ l b £ K. Hence, K = F. 

(iv) =^(i) Let f(x) £ K[x], deg f(x) > 1. There exists a field extension F/K 
such that F has a root of f(x), say, a. Then K(a)/K is an algebraic field ex¬ 
tension. Therefore, K{a ) = K and so a £ if. Thus, K is algebraically closed. ■ 

We now prove the existence of an algebraically closed field. The following 
proof is due to Artin. 

Theorem 21.3.3 Let K be a field. Then there exists an algebraically closed 
fi,eld F such that K is a, subfield of F. 

Proof. We first construct an extension F\/K such that if f{x) £ K[x] and 
deg/(x) > 1, then f(x) has a root in F\. Let KL be the set of all polynomials 
in K[x\ of degree > 1. Let 5 be a set which is in one-one correspondence with 
K,. For f(x) £ /C, let Xf be the corresponding element in S. 

Consider the polynomial ring K[S]. Let I be the ideal of K[S] generated 
by all polynomials f(xf) in K [S]. We claim that I 7^ iff.?]. Suppose that 
I = if" [S']. Then there exists gi £ if [S'] such that 

01/1(271) + 92 f 2 {x h ) + • • • + 9nfn{x fn ) = 1. (21.7) 
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Write Xi = Xf i7 1 < i < n. Since the polynomials gi, 1 < i < n, involve only 
a finite number of indeterminates, say, xi,x 2 , .. ■, x m , with m > n, we may 
write Eq. (21.7) as 

n 

- - .,Xm)fi(Xi) = 1. (21.8) 

2—1 

By Exercise 13 (page 468), there exists a finite extension L/K such that each 
polynomial fi, 1 < i < n, has a root in L. Let c* be a root of fi in L, 1 < i < n. 
Let ci = 0 for n < i < m. Substituting c* for Xi, 1 < i < n, in Eq. (21.8), we 
get 0 — 1, a contradiction. Hence. I ^ if[S]. 

Let M be a maximal ideal of if [S'] such that I C M. Let F\ = K[S]/M. 
Then F\ is a field containing an isomorphic copy (if + M)/M of if. Thus, F\ 
can be regarded as a field extension of if. Also, if / E if[x] and deg f{x) > 1, 
then Xf + M is a. root of / in F\. 

By induction, we can form a chain of fields 


Ei C F 2 C • • • C F n C • • • 


such that every polynomial of degree > 1 in F n has a root in F n+ \. Let F = 
U^iF„. Then F is a field. Let / E F[x], Then / E F n [x] for some positive 
integer n. Thus / has a root in F n+ \ C F. Hence, F is algebraically closed. ■ 

Corollary 21.3.4 Let if be a field. Then there exists an algebraic field exten¬ 
sion FjK such that F is algebraically closed. 


Proof. By Theorem 21.3.3, there exists a field extension E/K such that E 
is algebraically closed. Let F — {a E E | a is algebraic over if}. Then F/if 
is an algebraic extension. Let f(x) E F[x\ and deg/(x) > 1. Then fix) has a 
root c in E. Thus, c is algebraic over F. Since F/if is an algebraic extension, 
c is algebraic over if. Hence, c E F and so F is algebraically closed. ■ 

Definition 21.3.5 Let K be a field. A field F D if is called an algebraic 
closure of if if 

(i) F/K is algebraic and 

(ii) F is algebraically closed. 

For any field if, Corollary 21.3.4 guarantees the existence of an algebraic 
closure of if. 


Lemma 21.3.6 Let F and L be fields with L algebraically closed. Let a : F —> 
L be an isomorphism, of F into L. Let a be an algebraic element over F in some 
fi,eld extension of F. Let f{x) E F[x\ be the minim,a,l polynomial of a. Then a 
can be extended to an isomorphism, p of F(a) into L and the num,ber of such 
extensions is equal to the number of distinct roots of f{x). 
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Proof. Let f(x) = ao+aiz-j-- • -Toni 71 6 Fjx] and f a (x) = cr(ao)+ <j(a±)x J r 

• • • + a(a n )x n G L[x\. Since L is algebraically closed there exists a root b of 
f a (x) in L. Since a is algebraic over F, F(a ) = F[a] by Corollary 21.1.12. Thus, 
if u G F(a ), then u = cq + cia + • • • + c k a k G F[a], Define rj : F(a) —* L by 

r)(c 0 + H-h c fc a fe ) = cr(c 0 ) + <r(ci )6 H-h cr(c k )b k 

for all Co + Cia H-h c k a k G F(a). Suppose cq + cia + - • ■ + c k a k — do + Gfia + 

• • • + d s a s . Let j(x) = co + c\x + • ■ - + c k x k and 7 / (x) — do + d\x + ■ • • + d s x s . 

Then (7 — 7') (a) = 0. Hence, f(x) divides (7 — Thus, f a (x) divides 

(7°" — y a )(x ). Consequently, (7^ — j ,<7 )(b) = 0 and so <j(co) + cr(ci)6 + • - • + 
<j(c k )b k = cr(do) + cr{d\)b + • - - + a(d s )b s . Thus, 7 is well defined. Clearly 77 is 
an isomorphism. The number of distinct roots of f(x) in the algebraic closure 
of F is equal to the number of distinct roots of f a (x) in L. For any extension 
£ : F(a) —> L,£(a) is a root of f CJ (x). Therefore, the number of such extensions 
is equal to the number of distinct roots of f(x). ■ 

We close this section by showing that the algebraic closure of a field is uni¬ 
que up to isomorphism. Our proof uses Zorn’s lemma while Steinitz’s original 
proof used the equivalent concept of the axiom of choice. 

Theorem 21.3.7 Let F/K be a,n algebraic fi.eld extension. Let L be a.n alge¬ 
braically closed, field, and a be an isomorphism of K into L. Then there exists 
a,n isomorphism, 7 of F into L such that rj\x = (J. 

Proof. Let S = {(2?, A) | E is a subfield of F, K C E and A : F —*• L 
is an isomorphism such that \\k = a}. Since (K,cr) G S, S <f>. Let (E, A), 
(E',\') G S. Define a relation < on S by (E, A) < (E',X) if E C E' and 
A^e = A. Then (S,<) is a poset. Let {(Ei, A z )}^a be a chain in S. Let 
E = Ui G A-EC Then E is a field and K C E. Define A : E —» L as follows: 
Let a G E. Then a G E n for some n. Define A(a) = A n (a). Since {(Ei, A;)}; £ a 
is a chain, A is an isomorphism of E into L. Hence, ( E , A) G S and (E, A) is 
an upper bound of {(Ei, A^)}^^. Hence, by Zorn’s lemma, S has a maximal 
element, say, (T, 7 ). Suppose T F. Let a G F\T. By Lemma 21.3.6, there 
exists an isomorphism (3 : T(a) —» L such that ( 3 \t — 77. From this, it follows 
that (T(a),f 3 ) G S, a contradiction of the maximality of (T, 77). Thus, F — T. ■ 

Theorem 21.3.8 Let K be a, fi,eld,. Let F and F' be two algebraic closures of 
K . Then there exists a,n isomorphism, A of F onto F' such that A (a) = a for 
all a G K. 


Proof. Let a : K —■» F' be such that a (a) = a for all a G K. Then a is 
an isomorphism of K into F'. By Theorem 21.3.7, there exists an isomorphism 
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A : F —+ F' such that A |k = Now A (F) ~ F. Thus, A (F) is algebraically 
closed and K C A (F). Now K C A (F) C F'. Since F'/K is algebraic, F'/\(F) 
is algebraic- Thus, F' = A (F). Hence, F ~ F'. ■ 

21.3.1 Exercises 

1. If F is a field with a finite number of elements, prove that F is not 
algebraically closed. 
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Ernst Steinitz (1871-1928) was born 
on June 13, 1871, in Laurahiitte, Silesia, 

Germany. In 1890, he started his studies 
in mathematics at the University of Bres¬ 
lau (now Worclaw, Poland). In 1894, he re¬ 
ceived his Ph.D. He started teaching at the 
Technical College in Berlin-Charlottenberg. 

In 1920, he was appointed professor at the 
University of Kiel, where he remained until 
his death. He died on September 29, 1928. 

In 1910, he published “Algebraische The- 
orie der Korper" in which he gave an ab¬ 
stract definition of a “field.” He also intro¬ 
duced the notion of a prime field, separable element, perfect field, and degree of tran¬ 
scendence of an extension. With the help of the axiom of choice, he proved that for any 
field K there exists a field extension F/K such that every polynomial over K splits 
into linear factors over F and the smallest such field is unique up to isomorphism. 
He called such field, algebraically closed. His work on field theory was influenced by 
Weber and Kronecker. 

Steinitz also worked on the theory of polyhedra. 






Chapter 22 


Multiplicity of Roots 


22.1 Multiplicity of Roots 

In some cases, an irreducible polynomial p(x) of degree n over a field K does 
not have n distinct roots in a splitting field of p(x) over K. In this chapter, we 
examine this situation. 

If fix) is a polynomial over K and c is a root of fix) in some field F 
containing K , then the multiplicity of c is the largest positive integer m such 
that (x — c) m divides f(x) over F. 

Definition 22.1.1 Let K be a field and p(x) be an irreducible polynomial in 
K[x ] of degree n. Then p{x) is called separable if it has n distinct roots in a 
splitting field S of p{x) over K\ otherwise p(x) is called inseparable over K. 
An arbitrary polynomial in K[x} is called separable if each of its irreducible 
factors in K[x\ is separable; otherwise it is called inseparable. 

Definition 22.1.2 Let F/K be a. field extension and c be an element of F 
which is algebraic over K. Then c is called separable (or separable alge¬ 
braic) over K if its minimal polynomial over K is separable; otherwise c is 
called inseparable over K. If F/K is an algebraic extension, then F/K is 
called separable (or separable algebraic) if every element of F is separable 
over K] otherwise F/K is called inseparable. 

Let F/K be a field extension and L be an intermediate field of F/K. Let 
ce F and suppose c is separable over K. Then c must be separable over L. 
This follows since if f{x) and p(x) are the minimal polynomials of c over K 
and L, respectively, then p(x)\f(x). Hence, c cannot be a multiple root of p{x) 
since it is not one of f(x). 

Example 22.1.3 Consider the field K(t ), where K is a field of prim,e charac¬ 
teristic p and t is transcendental over K. It follows that the polynomial x p —t p is 
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irreducible over K(t p ) by Eisenstein’s criterion since t p is irreducible in K[t p ]. 
Now x p — t p factors into 

(x — t)(x — t) ■ ■ ■ {x — t)= ( x — t) p 

" -V- / 

p times 

over K(t). Thus, K(t ) is a splitting field for x p — t p over K(t p ) and we see that 
x p — t p has only one root in K(t ), namely, t. (Since t ^ K(tT), we can also 
use Theorem 21.2.14 to deduce that x p — t p is irreducible over K(t p ).) Thus, 
x p — t p , t, and K(t) are inseparable over K{t p ). Note that t has multiplicity p 
over K{t p ). 

Let K be a field and 


f(x) = ko 4- k\x + • • • + k n x n 

be a polynomial in K[x]. Then by the formal derivative, f(x ), of f{x) we 
mean the polynomial 

f'(x) = k\ + • • • + ikix 1 ^ 1 + • • ■ + nk n x n ~ l G K[x\. 

Let K be a field and f(x), g(x) G K[x\. The following properties of formal 
derivatives are easily verified: 

(f(x)+g(x)) r = f(x)+g'(x), 

(f(x)g(x))' = f{x)g'{x) + f'(x)g(x), 

(kf(x)Y = kf'(x) for all k G K 

and if f{x) = x, then f{x) = 1. 

Theorem 22.1.4 Let K be a fi,eld and f(x) G K[x], f(x) 0. Let a be a root 
of f(x) in some extension field F of K. Then a is a multiple root of f{x) if 
and only if f'(a) = 0. 


Proof. Suppose a is a multiple root of f(x). Then ( x — a) 2 divides f(x). 
Hence, 

f(x) = (x - a) 2 g(x) 

for some g(x) G F[x\. Now f'(x) = (x — u){(2; — a)g'(x) + 2g{x)}. Therefore, 
f'(a) = 0. Conversely, suppose f'(a) = 0. Then deg f(x) > 2. By the division 
algorithm, 

f(x) — (x - a) 2 q(x ) + h(x) 

for some q(x),h(x) G F[x], where either h(x) = 0 or deg h(x) < 1. Suppose 
h(x) -=fi 0. Since f(a ) = 0, h(a) — 0. Thus, deg/i(x) = 1 and a is a root of h{x). 
Hence, h(x) = b{x — a) for some 0 7 - b £ K. This implies that 



x — a) 
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and so 

f'{x) = (x - a){(z - a)q(x) + 2 q(x)} + b 


Therefore, 


0 = f'(a) = 6, 


a contradiction. Hence, h{x) = 0 and so f{x) = (x — a) 2 q{x). Consequently, a 
is a multiple root of /( x). ■ 


Theorem 22.1.5 For any field K, an irreducible polynomial p{x) in K[x\ is 
separable if and only ifp(x ) and its form,al derivative p'(x) are relatively prime. 


Proof. Let d(x) denote the gcd of p{x) and p'{x). Suppose p{x) is separable. 
Let c be a root of p{x) in some field containing K. Then p(x) = (x — c)f(x) 
for some f(x) € K(c)[x\. Since p{x) is irreducible, /(c) ^ 0. Now p'(x) = 
f(x) + (x — c)f'(x) and so p'(c) = /(c) + 0^0. Hence, c is not a root of d(x). 
But every root of d(x) must be a root of p(x) since d(x)\p(x). Thus, since we 
have just seen that d(x) and p(x) have no common roots, d(x) has no roots. 
Therefore, d(x) = 1. 

Conversely, suppose that d(x) = 1. Let c be any root of p(x). Let m denote 
the multiplicity of c. Then 

p(x) = (x - c) m f(x ) 

over K(c ) and c is not a root of f(x). Now 

p'(x) — m{x — c) m_1 f{x) + (x — c) m f{x) 

= {x - c) 7n ~ 1 [mf(x) + (x - c)f'(x)}. 

Thus, (x — c) 771 ' 1 is a common divisor of p'(x ) and p{x). Hence, 

{x — c) 771-1 \d(x). 

Since d(x) = 1 , m = 1 . Consequently, p(x) has no repeated roots. ■ 

Theorem 22.1.6 For a,ny field K , an irreducible polynomial p(x) in K[x\ is 
separable if and only if p’(x) 0. 


Proof. Let d{x) denote the gcd of p(x) and p'{x). Suppose pix) is separable. 
If p'{x) = 0, then d(x) = p(x) 1, a contradiction of Theorem 22.1.5. Con¬ 
versely, suppose p'(x) 0. Since p(x) is irreducible, the only common divisors 

of p(x) and p'(x) are 1 and pix). Since 1 < degp'(x) < degp(x), 1 is the only 
common divisor of p'{x) and p(x). Hence, d{x) = 1. Thus, p(x) is separable by 
Theorem 22.1.5. ■ 


Corollary 22.1.7 Let K be a field of characteristic 0. Then every nonconstant 
polynomial in K[x] is separable. 



22.1. MULTIPLICITY OF ROOTS 


477 


Proof. Let f(x) be any nonconstant polynomial in K[x\ and p(x) = ko 
+k\x + k 2 X 2 + • • • + k n x n be any irreducible factor of f(x)> where n > 1. Then 
there exists i > 0 such that ki ^ 0. Hence, ik z 0 since K has characteristic 
0. Thus, p'(x) ^ 0 and so p{x) is separable by Theorem 22.1.6. Hence, f{x) is 
separable. ■ 

Example 22.1.8 Consider the irreducible polynomial p(x) = x p — t p over 
K(t p ) of Example 22.1.3. Then p'(x) = px p ~ l = 0. Thus, x p — £ p is insepa¬ 
rable over K(tP). 

Theorem 22.1.9 Let K be a field of characteristic p > 0. Then an irreducible 
polynomial p(x) = ko + k\x 4- k 2 X 2 + • • • T k n x n over K is inseparable if and, 
only if p{x) = q{x p ) for som,e q{x p ) £ K[x p ]. 

Proof. Clearly p'M = o if and only if iki = 0 for all i = 1,2,..., n. Thus, 
p'(x) = 0 if and only if p\i for those i such that ki ^ 0. Hence, p'{x) = 0 if and 
only if p{x) = q(x p ) for some q(x p ) £ K[x p ). The conclusion now follows from 
Theorem 22.1.6. ■ 


Let K be a field of characteristic p > 0. Let K p = {a p | a £ K}. The reader 
is asked to verify in Exercise 7 (page 490) that K p is a subfield of K. 

Definition 22.1.10 Let K be a field. Then K is called perfect if every alge¬ 
braic extension of K is separable. 

Example 22.1.11 By Corollary 22.1.7, every fi,eld, of characteristic 0 is per¬ 
fect 

The following theorem gives a necessary and sufficient condition for a field 
to be perfect. 

Theorem 22.1.12 Let K be a fi.eld of characteristic p > 0. Then K is perfect 
if and only if K = K p . 


Proof. Suppose K is perfect. Let a £ K and F be a splitting field of 
x p — a £ X[a:]. Then F/K is a separable extension. Let b £ F be a root of 
x p — a. Then 

x p - a = (x - b) p . 


Let p(x) £ K[x\ be the minimal polynomial of b. Thenp(x) has distinct roots. If 
degp(x) > 1, then since p(x)\(x — 6 ) p , p(x) has multiple roots, a contradiction. 
Hence, degp(x) = 1. This implies that b £ K. Hence, a = bP £ K p . Thus, 


K = K p . 
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Conversely, suppose K = K p . Let F/K be an algebraic field extension. 
Let a G F and /( x) £ K[x\ be the minimal polynomial of a. Suppose f(x) is 
not separable. Then by Theorem 22.1.9, f(x) = g{x p ) for some g{x ) £ K[x\. 
Hence, 

f{x) = ao + a\x p +-b a k x pk , 

di £ K, 1 < i < k. Since K = K p , a* = b? for some 6* £ K, 1 < i < k. 
Therefore, 

f(x) = (6o + b\x H-b b k x k ) p , 

a contradiction, since /(x) is irreducible over iC Hence, /(x) is separable. 
Thus, F/K is a separable extension. Consequently, K is perfect. ■ 

Example 22.1.13 Let K be a finite field of characteristic p. Define cr : K —> 
K p by cr(a) = a p . Then a is a. hom,om,orphism. Suppose that cr(a) = cr(b). Then 
a p — bP and, so (a — b) p =■ 0. Since K is a fi,eld, K has no nonzero nilpotent 
elements. Thus, a = b and, so a is one-one. Hence, \K\ = \a(K)\ < \K P \ < \K\ 
and, so \K\ — \K P \. Since K p is a subfield of K a,nd, K is finite , K = K p . Hence, 
K is perfect. We have thus shown that every finite field is perfect. 

If p(x) = ko + kix + k 2 X 2 + • • - + k n x n is irreducible and inseparable over 
K in Theorem 22.1.9, then p(x) — k$ + k p x p + •• ■ + k prn (x p ) Tn — q(x p ). It 
may be the case that p(x) — q(x p ) = s(x p ) in K[x p ]. However, there exists a 
largest positive integer e such that p(x) — t{x p ) for some t{x p ) £ K[x p ]. If 
n = deg p{x), then p e \n. 

Definition 22.1.14 Let K be a fi,eld, of characteristic p > 0 and p{x) be an 
irreducible polynomial in K[x\. Let e be the largest nonnegative integer such 
that p{x) — q(x p£ ) for some q(x pS ) € K[x pe ). Then e is called the exponent 
of inseparability of p{x ) and p e is called, the degree of inseparability of 
pix). If n denotes the degree of p{x), then uq = is called, the degree of 
separability or reduced degree of p{x) over K. 

By Theorem 22.1.9, p(x) in Definition 22.1.14 is separable if and only if 

e = 0. 

Theorem 22.1.15 Let K be a fi,el,d of characteristic p > 0 and 

p(x) = k no (x pe ) no H-b k\x pe + k 0 

be an irreducible polynomial in K[x], where e is the exponent of insepambility 
of p(x). Then the polynomial 

s{y) = k no y n ° -4--b kiy + k 0 6 K[y] 

is irreducible and separable over K. 
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Proof. If s(y) = f(y)g(y) € K[y], then p(x) = f(x p£ )g{x pe ), contrary to 
the fact that p(x) is irreducible in K[x\. Thus, s(y) is irreducible in K[y]. If 
s(y ) = q(y p ) for some q(y p ) E K[y p ], then p(x) = q(x pe+1 ), contrary to the 
maximality of e. Hence, s(y ) is separable. ■ 

Example 22.1.16 Consider the polynomial p(x) = x 2p + tx p + t over the 
field K{t ), where K is a, fi,eld of characteristic p > 0 and t is transcendental 
over K. By Eisenstein’s criterion, p{x) is irreducible over K{t). Now p(x) = 
( x p ) 2 +tx p +t E K(t)[x] a.nd, so p(x) is inseparable over K(t). The inseparability 
exponent e of p{x) equals 1. Thus, x 2 + tx + t is separable over K{t). 

Definition 22.1.17 Let F/K be a field extension. F is called a simple ex¬ 
tension if F — 7T(a) for some a E K. Such an element a is called a primitive 
element. 

Theorem 22.1.18 Let K be an infinite field and K{a,b)/K be a field exten¬ 
sion with a algebraic over K and. b separable algebraic over K. Then there exists 
an element c E K(a,b) such that K(a : b ) = AT(c), i.e., K(a,b)/K is a simple 
extension. 


Proof. Let f(x) and g{x) be the minimal polynomials of a and b over 
K with degrees n and m and roots a = < 21 ,^ 2 , •••, o . n , and b — 
bm , respectively, in some extension field of K. Since b is separable, all bfis are 
distinct. Also, since K is infinite, there exists s E K such that a + sb ai~\~sbj, 
i.e., 

ai — a 




b~bj 


for all 1 < i < n, 1 < j < m. Let c = a + sb. Then c — sbj ai for all 1 < i < n, 
1 < j < m,. Also, K{c ) C K(a,b). Let h{x) — f(c — sx) E K{c)[x\. Now 


h{b) = f (c - sb) = /(a) = 0. 


Thus, g{x) and h{x) have the common root b of multiplicity 1 in the field 
K(a,b). Now 

h{bj) - f(c - sbj) ± 0 


for all 1 < j < m. Thus, g(x) and h(x) have only root b in common. Let 
d(x) E K(c ) [x] be the greatest common divisor of g{x) and h(x). Then b is a root 
of d(x). Every root of d(x) is also a root of g(x) and h(x). Since g(x) and h(x) 
have no roots other than b in common in any field and b is of multiplicity 1, d{x) 
is of degree 1. Hence, d(x) = x — b. But then b E K{c). Thus, a = c—sb E K(c). 
Therefore, K(a,b) C K(c) C K(a,b) and so K(c)= K(a,b). ■ 
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Corollary 22.1.19 Let K be an infinite field. Let ai,a 2 ,... ,a n be elements 
in som.e field containing K. Suppose tha.t ai is algebraic and a 2 ,...,a n are 
separable algebraic over K. Then there exists an element c E K(ai,... y a n ) 
such that K(c ) = K(a \,..., a n ), i.e., K(a\,... ,a n )/K is a simple extension. 

Proof. The result follows by induction on n and Theorem 22.1.18. ■ 

Corollary 22.1.20 Let F/K be afield extension and the characteristic of K 
be 0. Let a\ y < 22 ,..., a n E F be algebraic over K. Then K(a\,... ,a n )/K is a, 
sim.ple extension. 

Proof. The proof follows by Corollaries 22.1.7 and 22.1.19. ■ 

Example 22.1.21 Consider Q(v / 2 ,T). Now 1 ^. Thus, 

Q(\/2, i) — Q(\/2 + i) by the proof of Theorem. 22.1.18, with s = 1 there. 

Theorem 22.1.22 (Artin) Let K be an infinite field. Let F/K be a finite 
field extension. Then F/K is a sim,ple extension if and only if there are only 
a finite number of interm.edia.te fields of F/K. 

Proof. Suppose F/K is a simple extension. Let F = K(a) for some oGf. 
Let L be an intermediate field of F/K and f(x) be the minimal polynomial of 
a over L. Let V be the field generated by K and the coefficients of fix). Then 
L' CL and f{x) is also the minimal polynomial of a over V. Hence, 

{F : L] = deg f(x) = [F:L']. 

Thus, [L : L'] = 1 and so L — V. Let g{x) be the minimal polynomial of a 
over K. Then f{x) divides g(x). Now gix) has only a finite number of distinct 
monic factors. Hence, the number of intermediate fields is finite. 

Conversely, suppose there are only a finite number of intermediate fields of 
F/K. Let a, 6 E F. We first show that K(a,b)/K is a simple extension. Let 
c £ K and F c = K (a + cb ). Then for all c E K, F c is an intermediate field of 
K(a,b)/K. Since the number of intermediate fields is finite and K is infinite, 
there exists c,d E K, c d such that F c — Fd- Then 

b — (c — d)~ 1 (a + cb — a — db ) E F c . 

Hence, a = a + cb — cb E F c . Thus, K(a, b) = F c = K(a + c6), i.e., K(a , b)/K 
is a simple extension. Now for all a E F, K(a ) is an intermediate field of F/K. 
Since [F : K] is finite, [K(a) : K] is finite. Let 

A = {[K(a)-.JO |osn 
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Then A is a finite subset of Z. Let a E F be such that the maximum of 
A = [ K(a) : K ]. Suppose F ^ K(a). Let b E F be such that b K(a). Then 
K(a) C K(a,b). There exists c E F such that K{a,b) = K(c). Therefore, 
K(a) C X(c). Hence, [ K(c ) : jRT] ^ [-ftT(ci) ; /C], a contradiction to the maximal¬ 
ly of [K(a) : K ]. Consequently, F = K (a), i.e., F/K is a simple extension. ■ 

Let F/K be a field extension. In the next chapter, we show that every 
finite extension of a finite field is a simple extension (Corollary 23.1.8, page 
494). Hence, from this and Theorem 22.1.22, it follows that F/K is a simple 
extension if and only if there are only a finite number of intermediate fields of 
F/K. 

We now focus our attention on the study of separable algebraic and 
purely inseparable extensions. 1 

Theorem 22.1.23 Let K be a field of characteristic p > 0 and f(x) — x pe — k 
be a polynomial over K , where e is a, positive integer. Then f{x) is irreducible 
over K if and only if k £ K p . 


Proof. Suppose f{x) is irreducible over K. If k = k lp E K p for some k' E K , 
then f(x) — ( x pe — k') p , contrary to the fact that f{x) is irreducible over 
K. Hence, k £ K p . Conversely, suppose k ^ K p . Let p(x) be a nonconstant 
monic irreducible factor of f{x) in K[x\ and c be a root of p{x). Then c is a 
root of f{x) and so c p£ = k and f{x) — (x — c) p£ over K(c). Since K{c)[x\ is a 
unique factorization domain, it follows that p(x) is some power of (x — c), say, 
p(x) = (x — c) m . Thus, mn = p e for some n so that m — p T and n = p s for 
nonnegative integers r and s. Therefore, p(x) = x pT —c pV in X[x]. If s > 0, then 
k — c pe — (c pT ) pS E K pS C K p , which is contrary to the assumption k ^ K p . 
Thus, s = 0 and so r — e. Hence, p{x) = f(x), i.e., f(x) is irreducible. ■ 


Definition 22.1.24 Let F/K be a field extension of characteristic p > 0. Let 
c E F be a root of the irreducible polynomial p(x) in K[x\. If the degree of 
separability uq of p(x) equals 1, then c is said to be purely inseparable over 
K. If every element of F is purely inseparable over K , then F/K is called a. 

purely inseparable extension. 


In Theorem 22.1.15, let c be a root of p(x). Then c p£ is a root of s(y). 
We have K(c) D K(c p ) D K and c is a root of the polynomial x p — c p 
over K(c pC ). It follows that x pS — c pC is irreducible over K(c pe ), K(c)/K(c pe ) 
is purely inseparable, and K(c pS )/K is separable. 


1 The remainder of this section may be skipped without any discontinuity. The only place 
this material is needed is in Example 24.2.8. 
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Theorem 22.1.25 Let F/K be a field extension of characteristic p > 0 and c 
be an elem,ent of F. Then c is purely inseparable over K if and only if c p ™ E K 
for some nonnegative integer m. 


Proof. Let c be purely inseparable over K. Then the degree of separability 
no °f the minimal polynomial p(x) of c equals 1. Thus, p{x) — x p£ + k in K[x], 
where e is the exponent of inseparability of pix) over K. Therefore, c pe + k = 0 
or c p£ = —k E K. Hence, we can take m = e. Conversely, suppose cP m E K. Let 
e be the smallest nonnegative integer such that c pe E K. Then c is a root of 
the polynomial x p£ — k over iL, where k = c pB . If x p£ — k is not irreducible over 
K , then e > 0 and k = k' p for some k' E K by Theorem 22.1.23. In this case, 
x pe —k— ( x pS 1 — k') p . Thus, (c pe 1 — k') p — 0 and since a field has no nonzero 
nilpotent elements, c pB — k! — 0 or c pC = k! E K. However, this contradicts 
the minimality of e. Thus, x p& — k is irreducible over K. Clearly the degree of 
separability of x p — k is 1. Therefore, c is purely inseparable over K. ■ 


Corollary 22.1.26 Let F/K be a field extension of characteristic p > 0 a.nd 
c E F. 

(i) If c is algebraic over K, then c is purely inseparable over K if and only 
if the minimal polynomial of c over K is x pS — c pe , where e is the smallest 
nonnegative integer such that c p£ E K. 

(a) If c is purely inseparable over K , then [K(c ) : K] = p e for som.e non¬ 
negative integer e. 

(Hi) If c is purely inseparable and separable algebraic over K , then c E K. 


Proof. The proof of (i) follows from Theorem 22.1.25. Statement (ii) is an 
immediate consequence of statement (i). For the proof of statement (iii), we' 
see that since c is purely inseparable over K the minimal polynomial of c over 
K has the form x pS — k. Since c is separable algebraic over K , the exponent of 
inseparability of x p — k is 0, i.e., e = 0. Thus, x — k is the minimal polynomial 
of c over K , whence c = k E K. ■ 


Corollary 22.1.27 Let F/K be a field, extension of characteristic p > 0. 

(i) If F — K(M) for som.e subset M of F such that every element of M is 
purely inseparable over K , then F/K is a purely inseparable extension. 

(ii) Let L be an intermediate fi,eld of F/K. Then F/K is purely inseparable 
if and only if F/L and L/K are purely inseparable. 

(iii) The set of all elements of F which are purely inseparable over K is an 
intermediate field of F/K. 
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Proof. (i) Let c be an element of F. Then there exists a finite subset 
{mi,m 2 ,..., m s } of M such that 

where here we are using the fact that F = K[M] since F/K is necessarily an 
algebraic extension. Let e = max{ei,..., e s }, where ei is a nonnegative integer 
such that 1 G K, i = 1,..., s. Then 

c?e = C, S K> • * • ( m f Y s e k. 

hi--, U 

Hence, c is purely inseparable over K. 

(ii) Suppose that F/K is purely inseparable. Let c G F. Then there exists 
a nonnegative integer e such that c p G K and so c p£ G L. Thus, F/L is purely 
inseparable. L/K is purely inseparable since every element of L is an element 
of F. Conversely, suppose F/L and L/K are purely inseparable. Let c G F. 
Then there exists a nonnegative integer m such that c pm G L. Since L/K is 
purely inseparable, there exists a nonnegative integer n such that (c pm ) p ” G K. 
Therefore, c p ™ G K so that c is purely inseparable over K. 

(iii) Let J denote the set of all elements of F which are purely inseparable 
over K. Then K C J and so J ft fi. Let c, d G J. Then c pe G K and d pf G K 
for some nonnegative integers e and /. Let n = max{e,/}. Then (c — d) p = 
c P n _ d pn G K. Hence, c - d G J. If d ft 0, then ( cd~ l ) pn = c pn (d pn )- 1 G K. 
Thus, cgF 1 G J. Hence, J is an intermediate field of F/K. ■ 


Theorem 22.1.25 and Corollary 22.1.27(i) make it quite easy to construct 
examples of purely inseparable field extensions. 


Example 22.1.28 Let J be any field of characteristic p > 0; e.g., J = Z p . 
Let F — J(x, y, z ), where x, y, z are algebraically independent over J. Set 
K = J{x p , y p , z p ). Then F/K is purely inseparable since x,y,z are purely 
inseparable over K. It can be shown that [F : K] = p 6 since x, y, z are alge- 

2 L 3 J 3 

braically independent over J. Since x p , y p , z p G K , we hare F p C K. 

For any field F of prime characteristic p, Ff F p * is a purely inseparable field 
extension for any nonnegative integer e. 

The following example is essentially the same as that in Example 22.1.28. 


Example 22.1.29 Let J be any field of characteristic p > 0. Let K = J(x, y, 

z), where x, y, z are algebraically independent over J. Let F = J(a, b, c), where 

2 

a is a root of the polynomial t p — x over K , b is a root of the polynomial t p —y 
over K (a), and c is a, root of the polynomial tP — z over K(a,b). Then F/K 
is purely inseparable, [F : K] = p 6 , a.nd, F p C K. One often writes a — x p , 


r 


.-2 


an 


id c = z p 


-3 
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Example 22.1.30 Let J be any field of characteristic p > 0. Let K = J(t), 
where t is transcendental over J. Let F = K{t p \ t p 2 , t p 3 ,...). Then F/K 
is purely inseparable by Corollary 22.1.27. Since [K(t p 1 , t p 2 , t p 3 , • • ■ ,£ p ™) : 
K(t p 1 , t p 2 , t p 3 ,...,t p n+1 )] — P for all positive integers n, [F : K] = oo. 
There does not exist a, positive integer e such that F p C K. 

Example 22.1.31 Let J be any field, of characteristic p > 0. Let K = J{x i, 
X 2 , £ 3 , ...), where X\, X 2 , £3, ... are algebraically independent over J. Let Fq = 

—1 —2 —3 

K(x\ , x% , £3 Then Fq/K is purely inseparable and [Fo : K] — 00 . 

—2 —2 —2 

Lei F\ = K(x p , £ p , £3 Then F\/K is purely inseparable, [jF\ : K] = 

00 , and Ff C K. 

We now turn our attention to separable extensions. 

Theorem 22.1.32 Let F/K be a field extension of characteristic p > 0. If 
F/K is separable algebraic, then F = K(F P ). If [F : K] <00 and F = K(F P ), 
then F/K is separable algebraic. 


Proof. Suppose F/K is separable algebraic. Now every element of F is 
purely inseparable over F p and thus purely inseparable over K(F P ). Every 
element c of F is separable algebraic over K and thus separable algebraic 
over K(F P ). Thus, every element c of F is in K{F P ) by Corollary 22.1.26(iii). 
Hence, F C K(F P ), so that F = K(F P ). Conversely, suppose [F : K] < 00 and 
F = K(F P ). Let a be any element of F. Since [F : K] < 00 , a is algebraic over 
K. If a is not separable over K , then the minimal polynomial of a over K has 
the form 

(x p ) n + • * • + kix p + ko. 


Therefore, 0 = a np + ■ ■ ■ +kia p + ko ■ 1 with not all the k z — 0. Hence, 
l,a p ,...,a np are linearly dependent over K. By Theorem 21.1.14, 1, a, a 2 , 

..., a n , ..., a np ~ l are linearly independent over K, whence 1, a, a 2 , ..., a n are 
linearly independent over K. 

We now show that this is impossible by showing that whenever n elements 
b\,... ,b n of F are linearly independent over K, then the elements ..., 
are linearly independent over K. We can assume that 61 ,..., b n is a basis of 
F/K since any linearly independent set over K can be extended to a basis of 
F/K , in particular, the linearly independent set {l,a,... ,a n }. By Exercise 7 
(page 490), the mapping a : F —> F p defined by a(c) = c p for c € F is an 
isomorphism, which maps K onto K p . Thus, since &i,..., b n is a basis of F/K , 
frj, ...,&£ is a basis of F p /K p . Hence, 6 ^, ...,&£ spans F p over K p . Conse¬ 
quently, b j,..., &£ spans K(F P ) over K ; i.e., F over K. Since F has dimension 
n over K and the n elements span F over K, the elements tfi, ..., bP 
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must be a basis for F over K. ■ 


The field extension F/K of Example 22.1.30 shows that the finiteness con¬ 
dition [F : K] < oo cannot be dropped in the above theorem. We have 
F = K(F P ), F/K is not separable algebraic, in fact, F/K is purely insep¬ 
arable. 

Corollary 22.1.33 Let F/K be a field extension of characteristic p > 0. 

(i) Let a be an element of F. Then K(a) = K{a p ) if and only if K(a)/K is 
separable algebraic. 

(ii) Let ai, 02 ,..., a n be elements of F. Then K(a\,..., a n )/K is separable 
algebraic if and only if a\ is separable algebraic over K and, ai is separable 
algebraic over K{a i,..., a-i-i), i = 2, 3,... , n. 

Proof. (i) If K{a) = iT(a p ), then a cannot be transcendental over K and 
so a must be algebraic over K. By Theorem 22.1.32, K(a) = K(K(a) p ) if and 
only if K(a)/K is separable algebraic. We thus have the desired result since 
K{K(a) p ) = K(a p ). 

(ii) Suppose K{a \,..., a n )/K is separable algebraic. Then ai,..., a n are 
separable algebraic over K. By the discussion following Definition 22.1.2, ai 
is clearly separable algebraic over K{a i,..., a 2 _i), i = 2, 3 ,... ,n. Conversely, 
suppose ai is separable algebraic over K and a % is separable algebraic over 
K(a u ... ,a»_i),i = 2, 3,... ,n. TheniT(ai) = K(a\), ..., K(ai,.. .,a i ^ 1 )(a l ) = 
K(a i, ..., a i _i)(af > ), i = 2,3,..., n. Thus, K(a u ..., a n ) = K{a\, ..., a p ) = 
K([K(ai, ..., a^_i)] p ). The conclusion now holds from Theorem 22.1.32. ■ 

Corollary 22.1.34 Let F/K be a fi,eld extension of characteristic p > 0. 

(i) If F = K{M) for som.e subset M of F such that every elem,ent of M is 
separable algebraic over K, then F/K is separable algebraic. 

(ii) Let L be an interm.ediate field of F/K. Then F/K is separable algebraic 
if and only if F/L and L/K are separable algebraic. 

(Hi) The set of all elements of F which are separable algebraic over K is 
a,n intermediate field of F/K. 


Proof. (i) Let a <G F. There exists a finite subset {mi,..., m s } of M such 
that a 6 K(m\,... ,m s ). Since each mi is separable algebraic over IT, we have 
by Corollary 22.1.33(ii) that K{m \,... ,m s )/K is separable algebraic. Hence, 
a and thus F/K is separable algebraic. 

(ii) Suppose F/K is separable algebraic. Then F/L is separable algebraic 
by the discussion following Definition 22.1.2. L/K is separable algebraic since 
every element of L is an element of F. Suppose F/L and L/K are separable 
algebraic. Let a E F. Let co, ci,..., Cn £ L be the coefficients of the minimal 
polynomial p(x) of a over L. Since a is separable aDebraic over L, a is separable 



22 . 1. MULTIPLICITY OF ROOTS 


486 


algebraic over K(cq,ci } c n ). (p(x) is also the minimal polynomial of a 
over K(cq, ci, ..., c n ).) Since cq, ci, . .. , c n G L and L/K is separable algebraic, 
K(cq, ci, .. ., Cn)/K is separable algebraic by Corollary 22.1.33(h). Thus, a and 
so F is separable algebraic over K. 

(iii) Let S denote the set of elements of F which are separable algebraic 
over K. Then S T K. Let a,b G S. Then by Corollary 22.1.33(h), K{a, b)/K is 
separable algebraic. Since a — b G K{a,b) and (for b 0) ab~ l G K(a,b), a — b , 
and ab~ l (b ^ 0) are separable algebraic over K and thus are members of S. 
Hence, S' is a field. ■ 

Definition 22.1.35 Let FjK be an algebraic field extension of characteristic 
p > 0. Then the intermediate fi,eld of F/K consisting of all elements of F which 
are separable algebraic over K is called the separable closure of K in F or 
the maximal separable intermediate field, of F/K. We denote this field by 
K s . 

Theorem 22.1.36 Let F/K be an algebraic fi.eld extension of characteristic 
p > 0. Then F/K s is purely inseparable, where K s is the separable closure of 
F/K. 

Proof. If F — K s the theorem is immediate. Suppose F D K s . Let a G 
F,a^K s . Let 

pix) — ko + kix pe + ( x pe ) n ° 

be the minimal polynomial of F/K s , where e is the exponent of inseparabil¬ 
ity and no is the reduced degree of p{x) over K s . Now by Theorem 22.1.15, 
kg + k\y+- ■ ■-\-y n ° is the minimal polynomial of a pe over K s and this polynomial 
is separable over K s . Hence, a p is separable over K s . Thus, K s (a pe )/K s is sep¬ 
arable algebraic and so K s (a pS ) / K is separable algebraic. By the definition of 
K s , we have a p G K s . Therefore, a is purely inseparable over Ks- M 


We can think of field theory as being separated into two parts, namely, 
that in which the fields are of characteristic 0 and that in which the fields are 
of prime characteristic p. It can be shown that for any field extension F/K , 
there exists a subset X of F which is algebraically independent over K and 
which also has the property that F/K(X) is algebraic. The above theorem 
shows that the study of algebraic field extensions of characteristic p > 0 can be 
separated into two parts, the separable part and the purely inseparable part. 
Separable algebraic field extensions of characteristic p > 0 often act entirely 
similar to field extensions of characteristic Q. Purely inseparable field extensions 
have their own distinctive behavior. 


Definition 22.1.37 Let F/K be an algebraic fi,eld extension of characteristic 
p > 0. Then the degree \K S : K] is called the deqree of separability of 
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FjK and is denoted by [F : K] s . The degree [F : K s \ is called the degree of 
inseparability of F/K and is denoted by [F : K]i. 

Theorem 22.1.38 Let K be a. field, of characteristic p > 0 and p(x) an irre¬ 
ducible polynomial in K[x\. Let K{a ) be an extension of K obtained by adjoin¬ 
ing a root a of p{x) to K. Then 

[■ K(a ) : K} s = n 0 , 

[K(a) : K]i = p e , 

where no is the red,need, degree of p{x) over K a,ndp e is the degree of insepara¬ 
bility of p(x) over K. 

Proof. Let b £ K(a). Then b = ^^Zq 1 where n is the degree of p{x) 
over K and each ki £ K. Therefore, 


n— 1 

&P* = {a p ‘Y £ K(a?‘). 

i =0 

Thus, b is purely inseparable over K(a pS ). Hence, K(a)/K(a pe ) is purely insepa¬ 
rable. By the definition of the degree of inseparability of p(x) over K, K ( a pC )/K 
is separable algebraic. Now K s D K{a pe ). Let b £ K s . We have just seen 
that b is purely inseparable over K(a pe ). But b is also separable algebraic over 
K(a pC ). Therefore, b £ K(a pe ) so that K s — K(a pS ). By Theorem 22.1.15, 
the minimal polynomial of a p over K is of degree no and so [K{a) : K] s — 
[. K{a pe ) : K] = n 0 . Thus, n 0 p e - [K{a) : K] = [K(a) : K(a pS )}[K(a p *) : K] = 
[. K(a ) : K(a pe )]no. Consequently, p e = [F'(a) : K(a pe )] = [K{a) : Kfi. ■ 

Example 22.1.39 Let K denote the field Z p (n,n), where u a,nd, v are alge¬ 
braically independent over Z p . Let a be a root of the polynomial x 2p + vx p -j- u 
over K. By use of Worked-Out Exercise 6 (page 456,), one can deduce that 
x 2p + vx p + u is irreducible over K. Let F be the field K(a). We ask the reader 
to verify the following properties of the field extension F/K. K s = K(a p ), 
[F : K]i = Pi and [F : K] s = 2. Also, the extension F/K has no elements 
which are purely inseparable over K ( except those elem.en.ts which are already 
in K). Thus, if J is the interm,ed.ia.te field of F/K consisting of all the elements 
of F purely inseparable over K , then J — K. Hence, F/J is not separable al¬ 
gebraic. 


22.1.1 Worked-Out Exercises 

0 Exercise 1 Determine if the following polynomials are separable or insep¬ 
arable over the given fields. 
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(i) x 2 — 6 x 9 over Q; 

(ii) x 4 + x 2 + [ 1 ] over Z 2 . 

Solution: (i) x 2 — 62 ; + 9 = (x — 3 ) 2 over Q. Now x — 3 is irreducible over 
Q. Since x — 3 is separable over Q, x 2 — 6 x + 9 is separable over Q. 

(ii) x 4 + x 2 4 - [1] = ( x 2 + x + [l ]) 2 over Z 2 . Now x 2 + x + [1] has no roots in 
Z 2 . Hence, x 2 -fx-f[l] is irreducible over Z 2 . Now D x {x 2 + x + [1]) = [2]x + [l] = 
[ 1 ] ^ [ 0 ], Thus, x 2 + x + [1] and so x 4 + x 2 + [1] is separable over Z 2 . 

<0 Exercise 2 Prove that the following polynomials are irreducible over Z 3 (t), 
where t is transcendental over Z 3 . Find the exponent of inseparability and 
the degree of separability of the polynomials over Z $(t). 

(i) p(x) = x 36 + tx 18 + t. 

(ii) q{x) = x 24 + tx 18 + t. 

(iii) r(x) = x 20 T tx 18 + t. 

(iv) s(x ) = x 9 + t. 

Solution: Since t\t, /;|0, t /l, t 2 /£, the polynomials p(x), q(x), r(x), s(x) 
are irreducible over Z 3 (t). 

(i) p(x) = x 4 32 + tx 2-32 +1 and so the exponent of inseparability e = 2 and 
the degree of separability no = 4. 

(ii) q(x) — x 8 ' 3 + £x 6 ' 3 -f -1 and so the exponent of inseparability e = 1 and 
the degree of separability no == 8 . 

(iii) Since 3 /20, e = 0 and no = 20. 

(iv) Here e = 2 and no = 1. 

0 Exercise 3 Let f(x) and g{x) be polynomials over the field K. 

(i) Does /(c) — g{c ) for all c E K imply that f{x ) = g{x)l 

(ii) Does /(c) = 0 for all c £ K imply that /(x) = 0? 

Solution: (i) Let /(x) = [3]x 5 —[4]x 2 € Zs[x] and ^(x) = x 2 + [3]x G Zs[x]. 
Now /([0]) = [0] - 5 ([0]), /([l]) - [4] = g{[ 1]), /([2]) - [0] - g([ 2]), /([3]) - 
[3] = p([3]), /([4]) = [3] = g([ 4]). Hence, /(c) = g(c) for all c G Z 5 . However, 
f {x) =£ g(x). 

(ii) Let /(x) = x 2 + x G Z 2 [x]. Then /(c) = 0 for all c G Z 2 , but f{x) ^ 0. 

Exercise 4 Let K = P(x,y,z ) and F = K(z p 2 , z p 2 x p 1 + y p X ), where 
P is a perfect field of characteristic p > 0 and x, y , z are algebraically 
independent indeterminates over P. Prove that K p fl F = K(z p ), 
where K p ~ l = {k p ~ l | k G K}. 
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Solution: Clearly F D K p 1 fl F D K(z p 1 ). Now [F : K] = p 3 . Suppose 
that K p ~ l n Fd K(z p ~ 1 ). Then F = (K p ^ D F){z p ~ 2 ) since z p ~ 2 £ K p ~ l D F 
and [K p 1 n F : K] must be p 2 . Thus, [F : K p fl F] = p. Since [K p 1 (F) : 
K p *] = p, any basis of F/(K P 1 D F) remains a basis of K p 1 (F)/K p 1 . Now 
Z = {1, z p ~ 2 , ..., ( z p ~ 2 ) p - 1 } is a basis of F/{K p ~ l D F). Also, 

p -1 

Z^XP- 1 +y p ~ 1 =J2HZ P ' 2 )\ 

2 = 0 

where ki € K p i-0,1, ..., p — 1. Since Z remains linearly independent 

over K p 1 , y p 1 = ko G K p 1 fl F and x p 1 = k\ G K p 1 fl F. Therefore, 
x p 1 ,y p 1 £ T. Thus, [F : K] = p 4 , a contradiction. Hence, K p fl F = 
FT(zP _1 ). 

22.1.2 Exercises 

1. Let f(x) £ K[x\, a polynomial ring over a field K and c € F, where F is 
an extension field of K. Prove that (x — c) 2 \f(x) if and only (x — c)\f(x) 
and (x — c)lf'(x). 

2. Let f(x) G K[x], a polynomial ring over a field K. Use Exercise 1 to 
prove that f(x) has no repeated roots in any extension field of K if and 
only if f(x ) and f'(x) are relatively prime. 

3. Let f(x) = x n — x G K[x\, a polynomial ring over a field K. Suppose 
that n > 2 and that either K has characteristic 0 or a prime p such that 
p does not divide n — 1. Prove that f(x) has no repeated roots in any 
extension field F of K. 

4. Let f{x) = x p ~k £ K[x], a polynomial ring over a field K of characteristic 
p > 0. Prove that either f{x) is irreducible over K or that f(x) is a power 
of a linear polynomial in F’fx]. 

5. Determine if the following polynomials are separable or inseparable over 
the given field. 

(i) x 2 — 4x + 4 over Q. 

(ii) x 5 6 + tx +1 over Zs(t), where t is transcendental over Z 5 . 

6 . Prove that the following polynomials are irreducible over Z 5 (u), where 
u is transcendental over Z 5 . Find the exponent of inseparability and the 
degree of separability of the polynomials over Z 5 (u). 

(i) p{x) = x 250 + ux 125 + u. 

(ii) g{x) = x 128 + ux 125 + u. 

(iii) s(x) = x 125 + u. 
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7. Let F be a field of characteristic p > 0. Prove that for any nonnegative 
integer e, F pe is a subfield of F. Prove also that the mapping a : F —» F pe 
defined by a (a) = a p is an isomorphism. 

8 . Prove that a root of the polynomials in Examples 22.1.16 and 22.1.39 
is neither purely inseparable nor separable algebraic over K(t) and K, 
respectively. 

9. Let K(a)/K be a field extension of characteristic p > 0. Prove that 
(K{a))P = K p (a p ). 

10. Let F/K be a finite field extension of characteristic p > 0. If [F : K] is 
not divisible by p, prove that F/K is separable. 

11 . Let F/K be an algebraic field extension and S be an intermediate field of 
F/K such that F/S is purely inseparable and S/K is separable algebraic. 
Prove that S = K s . 

12. Let P be a perfect field of characteristic p > 0. Let P{a)/P be an algebraic 
field extension. Prove that P(a)/P is separable and that P(a) is perfect. 

13. Let K be any field of characteristic p > 0. Prove that Z p is the smallest 
subfield of K which is perfect and r\?2 0 K pt is the largest subfield of K 
which is perfect. 

14. Verify the properties of the field extension F/K of Example 22.1.39. 

15. Answer the following statements, true or false. If the statement is true, 
prove it. If it is false, give a counterexample. 

(i) Let F be a field of characteristic p > 0. Since F ~ F p and F p C F, it 
follows that F p = F. 

(ii) Let F/K be a field extension of characteristic p > 0. Let c G F\K. 
Then it is impossible for c to be both separable and purely inseparable 
over K. 

(iii) Let F/K be a field extension of characteristic p > 0. Let c € F. Then 
it is impossible for c to be both separable and inseparable over K. 
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Emil Artin (1898-1962) was born 
on March 3, 1898, in Vienna, Austria. In 
1916, he passed his school certification and 
after one semester of university work, he was 
called for military service. In January 1919, 
he resumed his studies at the University of 
Leipzig, where he was awarded the Ph.D. 
degree in 1921. 

Artin was appointed lecturer in 1923, 
became extraordinary professor in 1925, and 
became ordinary professor in 1926 at the 
University of Hamburg. In 1937, along with 
his family Artin emigrated to the United 
States. He taught for a year at the University of Notre Dame and from 1938 to 
1946 at Indiana University. In 1946, he joined Princeton University, and in 1958 he 
returned to the University of Hamburg, where he remained teaching until his death in 
1962. 

In 1927, Artin proved the general law of reciprocity, which included all the previous 
known laws of reciprocity until the time of Gauss. It has become the main theorem of 
class field theory. In 1961, he published, with John Tate, Class Field Theory. 

In 1926, in collaboration with Otto Schreier, Artin developed the theory of real- 
closed fields. The following year, with the help of the theorem on real-closed fields, 
he proved the Hilbert problem of definite functions. Also in 1927 he expanded the 
theory of algebras of associative rings. In 1928, Artin introduced the notion of rings 
with minimum condition. In his honor, these are called Artinian rings. 

During the 1930s Artin started to reformulate Galois theory, using techniques of 
linear equations. In 1942, he published Galois Theory, reformulating it in an abstract 
setting as a relationship of field extensions and the subgroups of its automorphism— 
the we see it today—away from the classical approach as permutations of roots of an 
equation. He was fascinated by Galois theory, and in a 1950 lecture he said, 

“Since my mathematical youth I have been under the spell of the classical theory 
of Galois. This charm has forced me to return to it again and again, and to try to find 
new ways to prove its fundamental theorem.” 

Artin contributed to various areas of mathematics, including number theory, group 
theory, ring theory, field theory, geometric algebra, and algebraic topology. He was 
awarded the American Mathematical Society’s Cole Prize in number theory. He died 
on December 20, 1962. 






Chapter 23 

Finite Fields 


The theory of finite fields has come to the fore in the last 60 years due to 
newfound applications. The applications of finite fields are in coding theory, 
combinatorics, switching circuits, statistics via finite geometries, and certain 
areas of computer science. 


23.1 Finite Fields 

A finite field (or Galois field) is a field with a finite number of elements. 
If F is a finite field, then F has prime characteristic p and contains a subfield 
isomorphic to Z p . Since F has only a finite number of elements, [F : Z p ] < oo. 

We denote a finite field of n elements by GF(n). We will show in the next 
result that n must be a power of p. The result is due to E.H. Moore (1862- 
1932). The United States is indebted to Moore for its beginnings in abstract 
algebra and for its initial international recognition in research. 

Theorem 23.1.1 If F is a finite fi,eld of characteristic p and n = [F : Z p ], 
then F contains p n elements. 

Proof. Since [F : Z p ] = n, F/Z p has a basis of n elements, say, b\, & 2 5 • •., b n . 
Every element a of F is a linear combination of b\, 62 , ..., 6 n , i.e., a = a\bi + 

0 , 2^2 H-ba n 6 n , where E Z p , i = 1, 2, ..., n. Now Z p has p elements. Hence, 

F has at most p n elements. Since {&i, 62 ,..., b n } is linearly independent over 
Z p , a\b\ + a 2&2 + • • • + a-nb-n is distinct for every choice of < 21 , 02 ,..., a n . Thus, 
F has exactly p n elements. ■ 

Theorem 23.1.2 Every element of a finite field F of chara,cteristic p and of 
p n elements is a root of the polynomial x pU — x E Z p [x\. Moreover, F is a. 
splitting field of x pn — x over Z p . 
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Proof. First note that (F\{0}, •) is a commutative group of order p n — 1 . 
Thus, for all a G F\{0}, a p " -1 = 1, whence a pn = a. Clearly 0 pn = 0. Since 
F contains all the roots of x p — x, F contains a splitting field S of x pn — x 
over Tip. However, F is exactly the set of all the roots of x pn —x and so F — S. M 

In the following result, we once again use a positive integer and the concept 
of an isomorphism to completely characterize an algebraic structure. 

Corollary 23.1.3 Any two finite fields of p n elements are isom,orphic, where 
p is a prime and, n is a positive integer. 


Proof. If F and F' are finite fields with p n elements, then they are splitting 
fields of the polynomial x pU — x over Z p . Hence, F ~ F'. ■ 

The next theorem can be used to show that there exists an irreducible 
polynomial of arbitrary degree n over Z p . (See Exercise 8 , page 497.) Even 
though its proof is not constructive in nature, it is informative for certain 
applications. Exercises 5 and 6 can be used to actually count the irreducible 
polynomials of a given degree. There is an algorithm which can be used to 
test the irreducibility of a polynomial over a finite field—namely, Berlekamp’s 
algorithm. This algorithm is discussed in Isaacs. 

Theorem 23.1.4 For any prim,e p , there exists a fi.eld extension FfZ p of ar¬ 
bitrary finite degree n. 


Proof. Let S be the splitting field of the polynomial /(x) = x p ™ — x over 
Z p . Let a G S be a root of f(x) of multiplicity m. Then 

/(x) = (x - a) m g(x), 

where a is not a root of g(x). Now 

-I = fix) = {x- a) m_1 [mg(x) + (x - a)g'(x)\. 

This implies that (x — a) m_1 divides —1, whence m — 1 = 0. Thus, every root 
of /(x) in S has multiplicity 1. Hence, /(x) has p n distinct roots in S. Let F 
denote the subset of S, which consists of all roots of /(x). Let a, 6 G F. Then 
(a - b) pn =a pn -bP n = a - b. Therefore, a - b G F. For b ^ 0, 

{ab~ l ) pn = a pU [b pn )~ l = ab~ l G F. 

Thus, F is a subfield of S. Since F contains all the roots of /(x) and S is 
generated by the roots of /(x) over Z p , F = S. By Exercise 6 (page 497), 
[F : Zp] = n. ■ 

Theorem 23.1.5 Let F be a field and G be a finite subgroup of the multiplica¬ 
tive group F* =■ F\{0}. Then G is cyclic. 
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Proof. Since G is a finite Abelian group, G is a direct product of cyclic 
subgroups C u C 2i • Cfc, where \Ci\ = n*, ni > 1, and n 2 |n* + i, 1 < i < ft, 
by Theorem 9.1.7. From this it follows that p nfc = 1 for all g £ G. Thus, every 
element of G is a root of x Uk — 1 € F[x]. Since x Uk — 1 has at most distinct 
roots in F, |G| < rife. Now Ck is a subgroup of G and \Ck\ = n*.. Hence, G = Ck 
and so G is cyclic. ■ 

The following corollary is an immediate consequence of Theorem 23.1.5. 

Corollary 23.1.6 The multiplicative group of a finite fi.eld is cyclic. ■ 

Theorem 23.1.7 Let F be a finite field and F(a, b)/F afield extension with 
a, b algebraic over F. Then there exists c € F(a, b) such that F(a, b) = F(c), 
i.e. f F(a,b) is a simple extension. 

Proof. Since F(a, b)/F is algebraic, [F(a,b) : F] < oo. Thus, F{a, b) is 
a finite field since F is a finite field. Since F(a,b)\{ 0} is a cyclic group with 
some generator, say, c by Theorem 23.1.5, it follows that F(a, b ) — F(c). ■ 

Corollary 23.1.8 Every finite extension of a finite field, is simple. ■ 

23.1.1 Worked-Out Exercises 

0 Exercise 1 Prove that x 3 + x + [1] is irreducible in Z 2 [^c] - Write out the 
addition and multiplication tables for the field 

Z 2 [a:]/ (x 3 + x + [ 1 ]^> . 

Find a splitting field S\ for x 3 + x + [ 1 ] over Z 2 . Find a basis for S 1 /Z 2 
and [Si : Z 2 ]- 

Solution: x 3 + x + [1] is irreducible over Z 2 if and only if Z 2 contains no 
root of x 3 T x T [1]. Since [0 ] 3 + [0] + [1] 7 - [ 0 ] and [l ] 3 + [1] + [1] 7 ^ [0] in Z 2 , 
Z 2 contains no roots of x 3 + x + [1] over Z 2 . Hence, x 3 + x + [1] is irreducible 
over Z 2 . By Theorem 21.1.11, 

Z 2 W/ (x 3 + x + [1]^ = Z 2 (A), 

where A denotes the coset x + (x 3 + x + [1]). By Theorem 21.1.14, 

22 (A) = {[0], [1], A, A 2 , [ 1 ] + A, [ 1 ] + A 2 , A + A 2 , [1] + A + A 2 }. 

x 3 + x + [ 1 ] = (x + X)(x 2 + Xx + [ 1 ] + A 2 ) 


Now 
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and A 2 and A + A 2 are the roots of x 2 + Ax + [1] + A 2 . Since A 2 , A + A 2 E Z 2 (A), 
Z 2 (A) is a splitting field of x 3 + x + [l] over Z 2 . Let Si = Z 2 (A). Then {[ 1 ], A, A 2 } 
is a basis for Si/Z 2 and [S 1 : Z 2 ] = 3. Let a denote [1] + A + A 2 . The addition 
table for Z 2 (A) is given below. 


+ 

[0] 

[1] 

A 

A 2 

[1]+A 

[1]+A 2 

A+A 2 

a 

[0] 

[0] 

[1] 

A 

A 2 

[1] + A 

[1]+A 2 

A+A 2 

a 

[1] 

[1] 

[0] 

[1]+A 

[1]+A 2 

A 

A 2 

Q 

A+A 2 

A 

A 

[1]+A 

[0] 

A+A 2 

[1] 

a 

A 2 

[1]+A 2 

A 2 

A 2 

[1]+A 2 

A+A 2 

[0] 

Q 

[1] 

A 

[1]+A 

[1]+A 

[i]+A 

A 

[1] 

a 

[0] 

A+A 2 

[1] +A 2 

A 2 

[l]+A 2 

[1]+A 2 

A 2 

a 

[1] 

A+A 2 

[0] 

[1]+A 

A 

A+A 2 

A+A 2 

a 

A 2 

A 

(1]+A 2 

[1]+A 

[0] 

[1] 

a 

a 

A+A 2 

[1]+A 2 

[1] +A 

A 2 

A 

[1] 

[0] 


For the multiplication table, we make a few entries, such as ([ 1 ] + A)([ 1 ] + A) = 
[ 1 ]+A 2 and ([ 1 ]+A+A 2 )([ 1 ]+A 2 ) = [1]+A+A 3 +A 4 . We now reduce [ 1 ]+A+A 3 +A 4 
to the form a + 6 A + cA 2 , where a, b, c E Z 2 . We divide x 4 + x 3 + x + [1] by 
x 3 + x + [1] to obtain x 4 + x 3 + x + [ 1 ] = (x + [l])(x 3 + x + [ 1 ]) + x 2 + x. Thus, 
A 4 4- A 3 + A + [1] - (A + [ 1 ])(A 3 + A + [1]) + A 2 + A = [ 0 ] + A 2 + A. Hence, 
([1] + A + A 2 )([l] + A 2 ) = A + A 2 . 

<0 Exercise 2 Prove that x 3 + x 2 + [1] is irreducible in Z 2 [x]. Write out the 
addition and multiplication tables for the field 

Z 2 [x]/ (x 2, + x 2 + [l]) . 

Find a splitting field 5 2 for x 3 + x + [ 1 ] over Z 2 . Find a basis for 5 2 /Z 2 
and [S 2 : Z 2 ]. Compare your results with those in Worked-Out Exercise 
1 . 


Solution: Since [0 ] 3 + [0 ] 2 + [ 1 ] [0] and [ 1] 3 + [l ] 2 + [ 1 ] 7 ^ [0] in Z 2 , Z 2 

contains no roots of x 3 + x 2 + [1] over Z 2 . Hence, x 3 + x 2 + [1] is irreducible 
over Z 2 . By Theorem 21.1.11, 

Z 2 [*]/ (z? + x 2 + [1]^> = Z 2 (/x), 

where /j denotes the coset x + (x 3 + x 2 + [ 1 ]). By Theorem 21.1.14, 

Z 2 (m) = {[0], [1], T*■> [1] + [1] + M 2 , H + y?, [1] + y + y 2 }- 


Now x 3 + x 2 + [1] — (x + y){x 2 + ([1] + /T)x + fi + y 2 ) and y 2 and [1] + y + y 2 
are the roots of x 2 + ([1] + y)x + y + y 2 . Since y 2 , [1] + y + y 2 E Z 2 (p,), 
Z 2 (/x) is a splitting field of x 3 + x 2 + [ 1 ] over Z 2 . Let S 2 = Z 2 (/i). Then 
{[1], /x, ytx 2 } is a basis for S 2 /Z 2 and [5 2 : Z 2 ] = 3. The addition table for 
Z 2 (^.) is determined in a manner similar to that in Exercise 1. In fact, one may 


obtain the addition table by substituting u for A 


m the addition table of Zo(X). 
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We now consider multiplication. We note that ([ 1 ] + /x)([l] -f fi) = [ 1 ] -f- p, 2 . 
However, ([1] + fiF p. 2 )([l] + fi 2 ) — [1] + /i. + /i 3 +/r 4 = [1]. Hence, we note the 
first algebraic difference between Z 2 (A) and Z 2 (/i). 

0 Exercise 3 Show that there exists an isomorphism / of Z 2 (A) onto 

considered as vector spaces over Z 2 such that / is the identity on Z 2 
and /(A) = /*, /(A 2 ) = /x 2 , where A and /j. are as defined in Worked-Out 
Exercises 1 and 2, respectively. 

Solution: {[1], A, A 2 } is a basis for Z 2 (A) over Z 2 and {[ 1 ], /j,, /x 2 } is a basis 
for Z 2 (/x) over Z 2 . Hence, there exists a unique linear transformation / of 
Z 2 (A) onto Z 2 (/x) such that /([l]) = [ 1 ], /(A) = /x, and /(A 2 ) = /x 2 . This linear 
transformation is given by 

/(a[l] + bX + cA 2 ) = a[l] + bji + cfi 2 , 

where a, fe,c£ Z 2 . Since {[1], /lx, /x 2 } is linearly independent, / is one-one. 

0 Exercise 4 Show that Z 2 (A) and Z 2 (/x) are isomorphic as fields, where A 
and /x are as defined in Worked-Out Exercises 1 and 2 , respectively. 

Solution: Since |Z 2 (A)| = 2 3 = |Z 2 (m)J, Z 2 (A) and Z 2 (/x) are splitting 
fields of x 8 — ;r over Z 2 and thus are isomorphic. 

0 Exercise 5 Factor the polynomial x 8 — x over Z 2 . 

Solution: x 8 —x = x(x+ [l])(x 6 +x 5 +x 4 -|-x 3 -|-x 2 +x + [1]). Now x 2 + x-)-[l] 
is the only irreducible quadratic polynomial over Z 2 . But x 2 + x + [1] does not 
divide x 6 + x 5 + x 4 + x 3 + x 2 + x + [ 1 ]. We have that x 3 + x + [ 1 ] and x 3 + x 2 + [1] 
are irreducible polynomials over Z 2 and x 6 + x 5 + x 4 + x 3 + x 2 + x + [ 1 ] = 
(x 3 +x-(-[l])(x 3 +x 2 + [1]). Hence, x 8 —x = x(x+[l])(x 3 +x+[l])(x 3 -(-x 2 + [l]). 

<) Exercise 6 Find the roots of x 3 + x 2 + [1] in Z 2 (A), where A is as defined 
in Worked-Out Exercise 1 . 

Solution: [0] is a root of x, [ 1 ] is a root of x + [1], and A, A 2 , A + A 2 are 
roots of x 3 + x + [ 1 ]. Hence, [1] + A, [ 1 ] + A 2 , and [1] + A + A 2 are roots of 
x 3 + x 2 + [ 1 ]. 

<0> Exercise 7 Find the roots of x 3 + x + [ 1 ] in Z 2 (fj), where // is as defined in 
Worked-Out Exercise 2. 

Solution: [0] is a root of x, [1] is a root of x + [1], and fi, /r 2 , [1] + fi + /i 2 are 

roots of x 3 -fx 2 + fll. Hence, fll + u, ill4-a 2 , and ll+ u 2 are roots of x 3 + x+ 111. 
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0 Exercise 8 Show that there exists an isomorphism g of Z 2 (A) onto Z 2 ([1] + 
y) such that g( A) — [1] + y, where A and y are as defined in Worked-Out 
Exercises 1 and 2, respectively. 

Solution: The result here follows immediately by Corollary 21.2.9. 

<0> Exercise 9 Show that there does not exist an isomorphism h of Z 2 (A) onto 
Z 2 (/^) such that h( A) = y, where A and y are as defined in Worked-Out 
Exercises 1 and 2, respectively. 

Solution: Suppose there exists an isomorphism h of Z 2 (A) onto 
such that h{ A) = y. Then [0] = h([0]) = h( A 3 + A + [1]) = y 3 + y + [1]. Also, 
[0] — /i 3 + y 2 + [1]. Hence, y? 4 - y + [1] = /x 3 + y? + [1]. Thus, y? — y. Therefore, 
y — [1], a contradiction. 

23.1.2 Exercises 

1. Let F be a finite field. A generator for F* = F\{0} is called a primitive 
element for F. Find a primitive element for the following fields. 

0 ) Z 7 . 

(ii) Zn. 

(iii) F, where F D Z 2 and [F : Z 2 ] = 8. 

2. Construct a field with 9 elements. 

3. Construct a field with 27 elements. 

4. Suppose that F is a finite field of characteristic p. If c is a primitive 
element of F, prove that c p is a primitive element of F. 

5. Let F be a finite field of characteristic p. If n = [F : Z p ], prove that there 
exists c G F such that c is algebraic of degree n over Z p and F = Z p (c). 

6. If F is a finite field of p n elements, p a prime and n a positive integer, 
prove that [F : Z p ] = n. 

7. Describe the splitting field of x 32 — x over Z 3 . 

8 . Prove that there exists an irreducible polynomial of arbitrary degree n 
over Z p . 

9. If F is a subfield of GF(p n ), prove that F ~ GF(p m ), where m|n. 

10. Show that if m, and n are positive integers such that m|n, then GF{jp n ) 
contains a unique subfield GF(p m ), p 171 —1 divides p n — l, whence x pTn ~ l — 1 

divides x pn ~ 1 — 1 and so x pm — x divides x pn — x. 

~PuFuc- 
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11. Let F be a field containing Z p and f{x) be a polynomial over Z p . If c e F 
is a root of f(x), prove that c p is also root of f{x). 

12. Let f(x) — x p - x — [1] £ Z p [x\. Show that a splitting field of f(x) over 
Z p is Z p (c), where c is a root of f(x). 

13. Let F be a field and G and H be subgroups of F*. If G and H have order 
n, prove that G — H. 

14. If F is a field such that F* is cyclic, prove that F is finite. 


“Public. 
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Evariste Galois (1811-1832) was bo¬ 
rn on October 25, 1811 in Bourg-la-Reine, 
near Paris, France, into a well-educated fam¬ 
ily. Galois received his early education from 
his mother. His father was director of a 
school. He read Legendre’s Geometrie at a 
very young age and mastered it in one read¬ 
ing. He then read Lagrange’s work, acquir¬ 
ing a solid background. In 1828, he started 
reading recent works on the theory of equa¬ 
tions and the theory of elliptic functions. 

Galois twice failed the entrance exami- 
/ 

nation for the Ecole Poly technique. In 1829, 
he took the entrance examination for the 
Ecole Normale Superieure, which trained fu¬ 
ture secondary school teachers. There he learned about Abel’s recent death and Abel’s 
last published memoir, which contained a number of results which Galois himself had 
obtained and presented to the Academy. Cauchy was assigned to report on Galois’s 
work. Cauchy advised him to revise his work, taking into account Abel’s results. (It 
was for this reason that Cauchy did not present a report on Galois’s memoir.) Galois 
then wrote a new text and submitted it to the Academy in February 1830. Fourier 
was assigned to report on it, but Fourier died before reading it and the memoir was 
lost. 

In June 1830, Galois published a short note on the resolutions of numerical equa¬ 
tions and a much more important article, “Sur la theorie des nombres,” containing a 
remarkable theory of “Galois imaginaries.” 

On January 17, 1831, Galois presented to the Academy a new version of his memoir. 
Poisson reviewed it and declared much of it incomprehensible. 

It was a time of great political unrest in France. Galois joined the National Guard, 
a republican party. He was in and out of prison. Arrested during a republican demon¬ 
stration on July 14, 1931, he was placed in detention. There he revised his memoir on 
equations and worked on the application of his theory of elliptic functions. Later he 
was transferred to a nursing home because of a cholera epidemic. There he resumed 
his work and wrote several essays on the philosophy of mathematics. He also became 
involved in a love affair. He was challenged to a duel. Badly wounded, he died on May 
30, 1832. On May 29, the day before his death, he wrote a letter to his friend Auguste 
Chevalier, sketching his principal results. He scribbled comments on the margin of his 
documents such as, “I have no time,” and asking Jacobi and Gauss’s opinion “not as 
to the truth, but as to the importance of these theorems.” 

In 1843, Louiville prepared Galois’s manuscript for publication and announced to 
the Academy that Galois had solved the problem considered by Abel. The manuscript 
was finally published in the October-November 1846 issue of the Journal des ma- 
thematiques pures et appliquees. 

Galois’s terse style and the great originality of his ideas contributed to the delay 
in the publication of his papers. 






Chapter 24 


Galois Theory and 
Applications 

The approach used today to present Galois theory is due to Artin. Artin 
reformulated the theory as an abstract relationship between a field extension 
and its group of automorphisms. He succeeded in disassociating Galois theory 
from the solvability of algebraic equations. 

24.1 Normal Extensions 

Definition 24.1.1 Let F/K be an algebraic field extension. F/K is called a 
normal extension if every irreducible polynomial f(x) 6 K[x\ such that f(x) 
has a root in F , splits into linear factors in F[x\. 

An example of a normal extension, which comes quickly to mind is F/K , 
where F is an algebraic closure of K. A more trivial example of a normal 
extension is F/K , where F — K. The field extension Q(v^)/Q is not a normal 
extension since the minimal polynomial of ^2 over Q is x 3 — 2 has two complex 
roots and Q(v^2) does not contain these roots. (Example 24.2.7 to follow.) 

Let F/K be a field extension and F be a subset of the polynomial ring 
K[x\. Then F is called a splitting field for F if for all fix) G F, f(x) splits 
into linear factors over F and for all proper intermediate fields L of F/K , there 
exists fix) G F which does not split over L. If F consists of a single polynomial 
g(x) and F is a splitting field for F, we say that F is a splitting field for g{x). 

Lemma 24.1.2 Let F/K be a finite field extension and c G F. Then there 
exists a field L D F and a polynomial g[x) G K[x] such that the following 
properties hold. 

(i) L is a splitting field for g[x) over K. 

(ii) Every irreducible factor of gix) in K[x] has a, root in F. 

(Hi) c is a root of g(x). 
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Proof. Let {v\,V 2 ,..., v n } be a basis of F over K. Let g{x) be the product 
of the minimal polynomials of c,vi,V 2 ,... ,v n over K. Then property (ii) and 
(iii) hold. Let L be a splitting field of g(x) over F. Then L = -F(r*i,r 2 ,...,r m ), 
where r\, 7-2, .... r m are the roots of g(x). Since V\,V 2 ,... ,v n E {ri, r 2 ,..., r m } 
and F = K(vi,V 2 ,, v n ), L is a splitting field of g[x) over K. ■ 

Theorem 24.1.3 LetF/K be a finite field extension. Then the following con¬ 
ditions are equivalent. 

(i) F is normal over K. 

(ii) F is a splitting field over K for som,e polynomial g(x) E K[x\. 

(iii) For every field L D F, all K-isomorphisms from, F into L map F onto 
F, i.e., are K-automorphisms of F. 

Proof. Suppose that statement (i) holds. By Lemma 24.1.2, there is a 
polynomial g(x) E K[x] and a field L D F such that L is a splitting field for 
g(x) over K and every irreducible factor of g(x) has a root in F. Since F is 
normal over K, each of these irreducible factors of g(x) splits over K. Hence, 
gix) splits over K. Thus, F — L and so (ii) holds. 

Suppose that statement (ii) holds. Then F is a splitting field over K for 
some polynomial g(x) E K[x\. Let L be a field containing F and a be a K- 
isomorphism of F into L. Then 01 (F) is a splitting field for a(g(x)) = g(x) over 
a(K ) = K. Since g(x) has a unique splitting field over K and contained in L, 
cx(F) = F. Hence, (iii) holds. 

Suppose that statement (iii) holds. Let c E F and f(x) be the minimal 
polynomial of c over K. By Lemma 24.1.2, there is a field L D F and a poly¬ 
nomial g(x) E K[x\ such that L is a splitting field for g(x) over K and c is a 
root of g(x). Thus, f(x)\g(x) and so f(x) splits over L. For each root b of f(x) 
in L, there exists a if-isomorphism a of K(c) onto K(b) such that a(c) — b by 
Theorem 21.2.8. By Theorem 21.2.10, a can be extended to a K- automorphism 
a of L such that cr(c) — b. Since <7 maps F onto F by hypothesis, b E F. Hence, 
all the roots of f(x) in L lie in F. Since /( x) splits over L, it must split over 
F. Therefore, (iii) holds. ■ 

24.1.1 Worked-Out Exercises 

0 Exercise 1 Let FjK be a field extension. Suppose that [F : K] =■ 2. Show 
that F is a normal extension of K. 

Solution: Let a E F be such that a K. Since [F : K(a)] ■ [ K(a ) : K] — 
[F : K] = 2 and a £ K, [K(a] : K] = 2. Let p(x) be the minimal polynomial of 
a over K. Then [K(a) : K ] — deg p(x) — 2. Now p(x) — (x — a)h(x) for some 
h(x) E F[x]. Thus, degh(x) = 1. Suppose h(x) = cx + d for some c,d E F, 

c -=fi 0. Then — c~ l d E F and — c~ l d is a root o £h(x). Therefore, — c~ l d is a 

riLHjc. matkewtatl (xclL 
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root of p(x). Hence, both the roots of p{x) are in F. Thus, F is the splitting 
field of p(x) over K. Consequently, F is a normal extension of K. 

0 Exercise 2 Let F = Q(v // 2) and L = Q(\/2). Show that F is a normal 
extension of L, L is a normal extension of Q, but F is not a normal 
extension of Q. 

Solution: Now [F : L] = 2 = [L : Q]. Hence, F is a normal extension of 
L, L is a normal extension of Q by Worked-Out Exercise 1. Now x 4 - 2 E Q[ x\ 
is irreducible over Q and y/2 is a root of x 4 — 2. Thus, x 4 — 2 is the minimal 
polynomial of \[2. Now the roots of x 4 — 2 are ± \/2 and ±i \[2. Since ±z \/2 ^ F, 
F is not the splitting field of x 4 — 2. Therefore, F is not a normal extension of 

Q 

<> Exercise 3 Let K be a field of characteristic 0. Let F/K be a finite normal 
extension. Let g(x ) € K[x\ and E be a splitting field of g(x) over F. 
Then E/K is a normal extension. 

Solution: By Corollary 22.1.20, F = K{a) for some a 6 F. Let h{x) be the 
minimal polynomial of a over K. Now h(x) splits over F. Let f{x) — g(x)h{x). 
Then K C F C E and f(x) splits over E. Let L be the splitting field of f(x) 
over K in E. Then K C L C E. Now a 6 L and hence K C F C L. Thus, L is 
the splitting field of g{x) over F. Hence, E = L. Consequently, E/K is normal, 
by Theorem 24.1.3. 

24.1.2 Exercises 

1 . (i) Show that C is a normal extension of R. 

(ii) Is R a normal extension of Q? 

2. Let K C L C F be a chain of fields. Suppose that F/K is a normal 
extension. 

(i) Show that F/L is a normal extension. 

(ii) Is L/K a normal extension? Justify your answer. 

3. Let K C L\, L 2 C F be fields. Suppose that L\/K and L 2 /K are normal 
extensions. Show that {L\ nL 2 )/K is a normal extension. 

4. x Let F/K be an algebraic field extension. Let K be the algebraic closure 
of K such that F C K. Prove that the following are equivalent. 

(i) F/K is a normal extension. 

^his exercise requires Section 21.3. 
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(ii) If a : F —► K is a if-homomorphism, then a is an automorphism of 
F. 

(iii) F is the splitting field of a family of polynomials in K[x\. 


24.2 Galois Theory 

We have now reached the point where we can begin our study of Galois theory. 
Roughly speaking, this theory relates the roots of a polynomial to certain 
permutations of these roots. More specifically, if F is a splitting field for some 
polynomial f(x) over a field K such that F/K is separable, then this theory 
sets up a one-one inclusion reversing correspondence between the intermediate 
fields of F/K and the subgroups of a particular group of automorphisms of 
F/K. These results can be applied to the solution by “radicals” of the equation 
f{x) = 0. This application will be discussed in Section 24.4. 

Theorem 24.2.1 Let F be a field, and ai,...,ct n be distinct, a,utom,orphism.s 
of F. Then for all a € F and for all a \,..., a n £ F, 

aioq(a) -|-f- a n a n (a) = 0 


implies tha,t a\ = • • ■ = a n = 0. 

Proof. The proof is by induction on n. If n — 1 and aiai(a) = 0 for all 
a £ F, then a\ = 0 since aq(l) ^ 0. Assume the theorem is valid for any m, 
distinct automorphisms, where 1 < m < n. Suppose 

aiai(a) + • • • + a n a n (a ) = 0 for all a E F (24.1) 

and for some oi,..., a n E f, not all zero, say, ai 0. Since the automorphisms 
oq,..., a. n are distinct, there exists b £ F such that aq(6) <a n (6). Since Eq. 

(24.1) is valid for every element of F, we have a\a.i(ab) H— • + a n a n (a&) = 0 or 

aiori(a)o;i(6) + • • ■ + a n a n (a)a n (b ) = 0 for all a 6 F. (24.2) 

Multiplying Eq. (24.1) by a n (b ) and subtracting this result from Eq. (24.2), 
we obtain 

ai(ai(6) - a n (6))ai(a) -j-+ a n _ 1 (a n _ 1 (6) - a n (6))a n _i(a) = 0 

for all a £ F. Since a\(b) a n (b), ai(oi(6) — o n (6)) 7^ 0. However, this 
contradicts the induction hypothesis. Hence, the theorem is valid for all positive 
integers n. ■ 


“Pufuc- 7^o^Lc#fta£uiaZ 



24.2. GALOIS THEORY 


504 


Definition 24.2.2 Let G be a group of automorphisms of the field F. An ele¬ 
ment a E F is called fixed by G if o(a) = a for all aGG. We denote by Fq 
the set of all a € F such that a is fixed by G. 

Theorem 24.2.3 Let G be a group of a.utom.orphism,s of the field F. Then Fq 
is a subfield of F , called the fixed field of F for G. 

Proof. Note that Fq <p since 0,1 G Let a, 6 E Fq- Then for all oGG, 
a(a — b) — a(a) — o(6) — a — b so that a — b E Fq. If 6 ^ 0, then a(a& -1 ) = 
a(a)a(b~ l ) = a(a)o;(5)”' 1 — so that a6 _1 E Fq. Thus, Fq is a subfield of 
F. m 

Definition 24.2.4 Let F/K be a fi,eld extension. Let G(F/K) denote the set 
of all K-automorphisms of F. 

Theorem 24.2.5 Let F/K be a field extension. Then G(F/K) is a. subgroup 
of the group of all automorphisms of F and is called the group of automor¬ 
phisms of F relative to K. 

Proof. Clearly the identity map is in G(FjK) so that G(F/K ) <f>. Let 
a, (3 € G(F/K). Then for all k E K, (a o fi~ l )(k) — a(p~ 1 (k)) = a(k) — k. 
Thus, a o e G{F/K ) so that G{F/K ) is a group. ■ 

We ask the reader to verify that any automorphism of F fixes the prime 
subfield of F. 

Theorem 24.2.6 Let H be a finite set of automorphisms of the fi,eld F. Then 

(i) \H\ < [F : F h \ and 

(ii) if H is a group, then \H\ = [F : Fh]- 

Proof. (i) Suppose \H\ > [F : Fh]. Then [F : Fh] = n < oo for some n. Let 
b\, ..., b n be a basis for F/Fh- There exist n + 1 distinct automorphisms aj, 
..., a n +1 in G(FjF h)- Then the system of n homogeneous linear equations in 
the n 4- 1 unknowns x\, ..., x n+ 1 , 

(bi)-£\ ■ ’ ■ T 0 ^ 7 x 4 -1 (bi)xji-\-\ — 0, i — 1,2,..., n 

has a nontrivial solution x\ = ai, ..., x n+ \ = a n+ \ in F. Thus, 

{fii)a i 4" ■ ■ ■ 4“ cn 7l j r \ (bpa n -\-\ — 0, i — 1,2,..., n. (24.3) 

Now every element a E F has the form a — hbi, h E Fh and so 

n 

ajaj(a) = j — 1,2 ,..., n + 1. 

2-1 “Pu^nc. 
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Then using Eq. (24.3) and the fact that each a* fixes k\,... , k n we obtain 

a i a i( a ) H-ha rt +iQ n +i((i) = E?=i fc*(aiQ!i( 6 t) 4- + a n+ ia n+ i(bi)) 

= 0 

for all a £ F. However, this contradicts Theorem 24.2.1. Hence, \H\ < [F-.F h \. 

(ii) By (i), |i?| < [F : Fyy]. We now show that F| > [F : F/yj. Suppose 
\H\ [E ; Fh\- Set |if| — ti. Then there are elements b\, ... , 6 n+i of F which 
are linearly independent over Fh• There exists a nontrivial solution a \,..., a n +\ 
in F satisfying the system of n homogeneous linear equations in the n + 1 
unknowns x \,..., x n+ \, 


xioti(bi) 4-h x n+ iai(b n+ i) = 0, z=l,2,...,n, (24.4) 

where a; £ if. From all such nontrivial solutions of Eq. (24.4), choose one 
having the smallest number, say, m of nonzero members. We have m > 1 else 
aict:i(&i) = 0 and hence a\ — 0. (Note that a\(bi) ^ 0 since oq is one-one and 
&i 7 ^ 0.) Upon reordering we have 


aiai(bi)-[ -h a m a,(6 m ) = 0, i = l,2, ...,n (24.5) 

and no a* — 0. Let aq be the identity map. Then 

aq&i + ■ • • + a m 6 m = 0, 

where we take a m = 1. (If a m ^ 1 , then multiply through by a” 1 .) Since 
&l,..., b m are linearly independent over Fh, not all ai,..., a m are in Fjj, say, 
a\ £ Fh- Thus, for some otj, a j( a i) / a-i- Apply ctj to Eq. (24.5). Then 

Oij (ojoq ( 6 i )) T ■ ■ ■ “I - otj ( 0 , 77 ^ oq (bm )) — 0 , i — 1,2,..., n 


or 


Oij (o , l)&ij (^l) T ‘ ‘ T Oij ((lm)&ij(bm) — 0, % — 1,2, 


* 5 


n, 


(24.6) 


where a^- = Oij o o^. Since if is a group, {oqj,..., a n; } = fi. If we relabel the 
Eqs. in (24.6) and then subtract Eq. (24.6) from Eq. (24.5), we obtain 


(oi aj(cii))ai(b\) + • ■ ■ + (o m _i aj (^m—i))^i(frm— l) — 0 , i — 1,2, ... ,, ti. 


Since a\ — 0 ^( 01 ) ^ 0, ai — aj(ai), ..., o m _i — a m (a m _i), 0,..., 0 is a non¬ 
trivial solution of Eq. (24.4) having fewer than m nonzero members. This 
contradiction thus shows that the assumption [F : Fh] > \m is false. Hence, 

\H\ > [F : Fh] so that [F : Fh] = \H\. ^ ^ Tlt a ±U JCLmja ±t CJ al 
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Example 24.2.7 Consider the field extension Q(\/2)/Q. Let a be any auto¬ 
morphism o/Q(v / 2). Then a fixes every element of Q. We ha.ve 

(a(\/2)) 3 = a((v / 2) 3 ) = a(2) = 2. 

Hence, a(y/', 2) is a root of x 3 — 2. Thus, a(\/ 2) = f[2 beca,use the other two cube 
roots of x 3 — 2 a,re complex numbers, nam.ely, v / 2(— and f/2 
and so are not members of Q(\/2). Hence, a is the identity map on Q(v // 2). 
Thus, G(Q(v // 2)/Q) = {e}. But Q(\/2) g(q( = Q(v^) D Q. We note 

that Q(\/2) not the splitting field of the polynomial x 3 — 2 over Q. iVott; 
1 = |G(Q(^)/Q)| < [Q(^2) : Q]- If the other two roots of x 3 — 2 were 
present, then we would have found an a such th,a.t a(\/2) ^ y/2. 

Example 24.2.8 Let F/K be any field extension of characteristic p > 0 such 
that there exists a G F, a ^ K, and a is purely inseparable over K. Let a be 
any a,utom,orph,ism of F, which fixes every elem.ent of K. Let e be a positive 
integer such that a p& ~ k G K. Then {cx(a) — a) p£ = a(a) p£ — a pe — a(a p£ ) — a pS 
= a(k) — k — k — k = 0. Since a field has no nonzero nilpotent elements, 
a (a) — a = 0 or a (a) = a. Hence, -Fg(F//C) 5 K( a ) K. Here we note 
that because of the presence of a, F/K would not be separable even if it were 
algebraic. 

For a field extension F/K , we will want F g ^ F / F ) — K. The above two 
examples point out difficulties we must overcome. 

Theorem 24.2.9 Let H be a finite group of automorphism,s on the field F. 
Then H = G(F/F H ). 


Proof. Clearly H C G{F/Fh)- By an argument similar to that of the 
first part of the proof of Theorem 24.2.6, G(F/F f ) is a finite group. Hence, 
by Theorem 24.2.6, \G{F/F H )\ = [F : F g ^ f/Fh) }. Now F h = F G ( F/Fh) since 
Ffj I> F G ( Fj / Fh } and if a G F f , then for all a G G(F/F f ), ox {a) = a so that 
a G F G ^ F /p H y Therefore, \H\ = [F : Fh) = [F : F G ( F / Fh )] = \G(F/Fh)\. Since 
H C G(F/Fh) and G(F/F f ) is finite, we have H = G(F/F f ). ■ 


Let us pause to see what we have so far. Let F/K be a finite field exten¬ 
sion. We desire a one-one inclusion reversing correspondence between all the 
intermediate fields of F/K and all the subgroups of G(F/K). From Examples 
24.2.7 and 24.2.8, we have seen that it is possible for an intermediate field L 
of F/K to be strictly contained in Fg(F/L)- Hence, a mapping 


L->G. 


m 


itfie Tftai/tewtaitea/ 


847) 
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need not be one-One since 

Fg{f/l) G{F/F G ( F j L ')) = G(F/L), 

but L C Fq^f/l} is possible. Note that the mapping 

Fh H (24.8) 

is one-one since by Theorem 24.2.9, H — G(F/F f )- The mapping of (24.8) is 
the “inverse” of (24.7), but the mapping in (24.8) does not map onto all the 
intermediate fields of F/K. We can thus see that we need some sort of condition 
on F/K to force every L = F G ( F / L y Examples 24.2.7 and 24.2.8 suggest the 
condition should be that F/K be separable and be the splitting field of some 
polynomial over K. A similar difficulty is not encountered with G(F/K) since 
H = G(F/F f ) by Theorem 24.2.9. 

Definition 24.2.10 Let F/K be a finite fi,eld extension. If Fq( F / F ) — AT, 
then G(F/K) is called the Galois group of F/K and F/K is called a Galois 
extension. 

Theorem 24.2.11 Let F/K be a finite extension. The following conditions 
a,re equivalent. 

(i) G(F/K) is the Galois group of F/K. 

(ii) F/K is normal and separable. 

(Hi) F is the splitting field of a separable polynomial in K[x\. 

Proof. Suppose [F : K] = n. Let H be a subgroup of G{F/K). Then by 
Theorem 24.2.6, \H\ = [F : F H ] < [F : K] = n. 

(i)=^(ii) Suppose G(F/K ) is the Galois group of F/K. Then 

\G{F/K)\ = [F : K\ = n. 

Since F/K is finite, F/K is an algebraic extension and F — K{u\, U 2 , ..., u n ) 
for some Ui G F, 1 < i < n. Let G(F/K) = {e = ai,a 2 ,.. ./an}- Let a G F 
and a — a\, a 2 , .. ., a m be distinct elements of the set {o^(a) | i = 1 ,,n}. 
Now oLj oq, G G{F/K) for all i and j. Let a z = c^(a), i = 1,2,... ,n. Then 
ctj(ai) = aj(ai(a )) = aj o a;(a) = a r (a ) = a T for some r, 1 < r < m. Since 
a*, is air automorphism of F, OL^af) = otkiafi) if and only if = aj. Thus, 
for all k, 1 < k < n , ak(a\), 0 ^( 02 ), - a)j(a m ) are distinct elements. Let 
f a (x) = (x—ai)(x—a 2 ) • • • (x—a m ). Then all roots of f a (x) are distinct and lie in 
F. Also, the factors of f a (x) are merely permuted by any a % of G(F/K). Thus, 
the coefficients of f a (x ) remain unaltered by any oci G G(F/K). Therefore, 
f a (x) G K[x] since K — F G ( F /k). Hence, a = ai is a root of a separable 
polynomial f a (x) in K[x\ and f a (x) splits over F. From this, it also follows 
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that for all i, 1 < i < n, iq is a root of a separable polynomial f Ui ( x ) in 
K[x] and f Ui ( x ) splits over F. Thus, all the roots of the polynomial f(x) = 
f Ul {x)f U2 (x) ■ • ■ f Un ( x ) £ K[x\ are in F. Since F = K(u\, U 2 , ..., u n ) and each 
Ui is a root of f(x), F is the splitting field of f(x) and so F/K is normal. 
Since each Ui is a root of a separable polynomial over K , it follows that F/K 
is separable. Consequently, F/K is normal and separable. 

(ii) =^>(iii) Since F/K is a finite separable extension, there exists a £ F such 
that F = K (a). Now a is a root of a separable irreducible polynomial f(x) £ 
K[x\. Since F/K is normal, f(x) splits over F. Thus, F contains all roots of 
f(x). Hence, F is the splitting field of a separable polynomial f(x) £ K[x\. 

(iii) =>(i) Suppose F is a splitting field of a separable polynomial f(x) £ 
K[x). Let 777, be the number of distinct roots of f(x) in F , but not in K. We 
prove the result by induction on m. If m = 0, then F = K and G(F/K) — {e}, 
where e is the identity automorphism of F. Hence, K = F = iv?(F//C)- Assume 
that the result holds for all field extensions S/T such that S' is a splitting field 
of a separable polynomial g(x) £ T[x\ with g(x) having fewer than m> 1 roots 
outside of T. 

Let f(x) — pi (x) ■ • • Pk( x ), where each Pi(x) is irreducible and separable 
in K[x\. Since m > 1, deg Pi(x) > 1 for some i. By renumbering if necessary, 
we may assume that i = 1, i.e., degpi(a:) = t > 1. Let a be a root of pi(x). 
Then \K(a) \ K\ — t. Since p\{x) is irreducible and separable, its roots a = 
ai, <i 2 , - - ■, at are all distinct. Thus, there exist isomorphisms o^, c^,..., a' t such 
that a[ : K(a) —» K(di) with ct'^a) = a{ and the elements of K are fixed by a^. 
Since F is a splitting field of f(x) over both K(a ) and K(a,i), the isomorphism 
a' can be extended to an automorphism of F, which maps a onto a z and 
fixes the elements of K, i = 1,2,..., t. 

Suppose now that c £ ^(F/F)- Since f(x) has fewer than m, roots outside 
K(a), K(a) = Fg(f/K{o)) by our induction hypothesis. Since G(F/K(a)) C 
G(F/K), c e As(F/F(a)) = K (a). Hence, 

c — ko T k\d T • • • ki—\o/ ^, k{ £ AT, i = 0,1,..., t — 1. 

Thus, 

Oii(c) — c — /c 0 + k\ai H-h kt-io/f 1 , i = 1 

Therefore, 

g{x) — (ko - c) + k\x H-f k t ~\x l ~ l 

has t distinct roots ai, <22, ..., at in F. Since deg^(x) < t, g(x) must be the zero 
polynomial. Hence, ko — c = 0 or c = ko £ K. Consequently, K = A(j(F/Fr)* ® 

Corollary 24.2.12 Let F/K be a fi,nite extension. The following conditions 
are equivalent. 

(i) \G(F/K)\ = [F : K\. 

(a) F/K is normal and separable. 


~Pu*uc- 
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Proof. Write G = G(F/K). 

(i) =»(ii)Now K C F G C F and [F : K\ = [F : Fq\[Fq : K]. Also, [F : K] = 
\G(F/K)\ = [F : Fg]. Hence, [Fq : K] = 1 and so Fgr = K. Thus, F/K is 
normal and separable by Theorem 24.2.11. 

(ii) =>(i) Since F/K is normal and separable by Theorem 24.2.11, K — 
F g(f/k) . Hence, \G(F/K)\ = [F : F G \ = [F : K}. ■ 


We are now ready to present the one-one inclusion reversing correspondence 
between the intermediate fields of a Galois extension and the subgroups of its 
Galois group. 

Theorem 24.2.13 (The Fundamental Theorem of Galois Theory) Let 

F/K be a finite normal and separable field extension. Let G — G(F/K), 
F = {L j L is an interm.ed.iate field of F/K}, and S(G) be the set of all 
subgroups of G. Then the following properties hold. 

(i) K = F a . 

(ii) The m,apping 4 r : F —+ S(G) defined, by 4 r (L) = G(F/L) for all L G T is 
a one-one correspondence. The m.apping $ : S(G) —» F defined by 4>(if) = Fh 
for all H G S{G) is the inverse of 4 r . Also, for all L G F, [F : L\ = \G(F/L)\ 
and [L : K] — [G : G(F/L)\. 

(Hi) Let L, V G F. Then V C L if and only if G(F/V) D G(F/L). In this 
case, [L : L'} = [G{F/L') : G(F/L)\. 

(iv) Let L, L' G F. Let 'L(L) = H and L') = H'. Then there exists oGG 

such that ot(L) = L' if and only if aHa~ l = H'. 

(v) Let L G F. Then L/K is a, norm,al extension if and only if G(F/L) is 
a norm,a,l subgroup of G. In this case, 

G(L/K) ~ G{F/K)/G{F/L). 


Proof. (i) Immediate from Theorem 24.2.11. 

(ii) Clearly 4/ is well defined. By Theorem 24.2.9, the mapping 4' is onto. 
Suppose G(F/L) = G(F/L'). Then F G (f/l) ~ Fg(F/L')- Since F/K is finite, 
normal, and separable, so is F/L for every intermediate field L of F/K. By 
(i), we have L = F G ( F /q = F G {f/L') = L'. Hence, the mapping 4^ is one-one. 
From Theorems 24.2.9 and 24.2.11, it follows that 4> is the inverse of T. By 
Theorem 24.2.6, [F : L] = \G(F/L)\. That [L : K] = [G : G(F/L)\ follows 
easily by Lagrange’s theorem and Theorem 21.1.20. 

(iii) Clearly L D V if and only if G(F/L f ) D G(F/L). That [L : L'\ - 
[G{F/L') : G{F/L)\ follows by (ii) since F/L' is normal. (Since 4> is one-one 
and onto S(G), we have L D L' if and only if G{F/L ) C G(F/L').) 

(iv) Suppose <a(X) = L'. For any a' gL, we have ot{a) = a' for some aGl. 
Now for all (3 G H, /3(a) = a. Therefore, Q:(/3(a _1 (a / ))) = a(/3(a)) = a(a) = a'. 
Thus, ao f3 o or 1 G H' so that aHa~ l C H'. Now \H'\ = [F : L'} = [F : L\ = 
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\H\ = \aHa~ l \. Hence, aHa ~ 1 = H'. Conversely, suppose aHa~ l = H'. 
Then for all a £ L and for all {3 £ H, a(P(<y~ 1 (a(a)))) — a(/3(a)) — a(a). 
Thus, a(L) C Fh' = 1/ Now |iF| = |F| . Therefore, [F : L] = [F : L r ], whence 
[a(L) : K] = [L : K\ = [V : K\. Consequently, a{L) = L'. 

(v) Since F/K is separable, L/K is separable and so by Corollary 24.2.12, 
we have L/K normal if and only if \G(L/K)\ = [L : K\. We now show that 
\G(L/K)\ = [L : K] if and only if every isomorphism of L leaving K fixed is 
an automorphism of L/K. 

For any a £ G, a determines an isomorphism of L leaving K fixed. On 
the other hand, if /3 is an isomorphism of L leaving K fixed, then since L/K 
is normal, /3 can be extended to an automorphism of F leaving K fixed by 
Theorem 21.2.10. 

Write H — G(F/L) and set m = [G : H], Now by (ii), m = [L : K}. Let 
H = a.\H , a. 2 H, ..., a m H be the distinct cosets of H in G. For a £ L and 
(3 £ F, (ai o 0){a) = c^(a) for each i since L = Fh • Thus, the elements of G 
in the same coset of H determine the same isomorphism of L. Conversely, if 
a(a) — <y'(a) for all a G L, then a = (a -1 o a')(a) or a -1 o a' E if so that 
a, ol determine the same coset of H in G. Therefore, the number of distinct 
isomorphisms of L fixing the elements of K is m — [G : H]. If \G{L/K) \ = m, 
then every isomorphism of L fixing the elements of K must be an automorphism 
of L/K since every automorphism of L is an isomorphism of L. Conversely, if 
every isomorphism of L fixing the elements of K is an automorphism of L, then 
\G(L/K)\ is the number m of these isomorphisms. Hence, \G(L/K)\ = [L : K] 
if and only if every isomorphism of L leaving the elements of K fixed is an 
automorphism of L/K, oi L/K is normal if and only if every isomorphism of L 
leaving the elements of K fixed is an automorphism of L leaving the elements 
of K fixed. 

Now, every isomorphism of L leaving the elements of K fixed is an auto¬ 
morphism if and only if a(L) = L for all a E G. By (iv), a(L) = L for all 
a EE G if and only if H = aHa~ l , i.e., if and only if H is normal in G. 

If L/K is normal, then the distinct automorphisms of L fixing the elements 
of K correspond uniquely to the cosets of H in G. This one-one correspondence 
is clearly an isomorphism of G(L/K) and G/H since for a, a' £ G, we have 
that a o a' corresponds to ( aH)(a'H) = a o a'H. ■ 

Let F/K be a finite normal separable field extension and L be an interme¬ 
diate field of F/K. We have seen that F/L is a normal extension, but L/K is 
not necessarily normal. The above result tells us when L/K is normal. 


Example 24.2.14 Let S be the split ting field of the irreducible* polyn 

HilHjc. mo^Lc#fta£uial 
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x 3 — 2 over Q such that SC C. Now 

x 3 — 2 - (x-^)(x 2 + ^2x+^4) 

= (x - %2){x - -^(-1 + V^))(x - -^(-1 - v ^)). 

Thus, S = Q(^2, 4^(-l + VSi), 4^(-l - >/3i)) - Q(^2, \/3i) AW S/Q 
is normal and since (x 2 + \/2x + v^4) is irreducible over Q(v^), [s : Q] = 6. 
Hence, |G(S/Q)| = 6. 

The automorphisms of G(S/Q) are completely determined by where they 
map y/2 and, y/Si. The following table defines the group G(S/ Q). Set r 2 = 

^(-1 + VSi)a,nd r 3 = 4^(-l - VSi). 



e 

a 

P 

a (3 

pa 

a(3 a 

y/2 

y/2 

y/2 

T2 

r 3 

r2 

T3 

T2 

T2 

rs 

y/2 

y/2 

r 3 

r 2 

?3 

rs 

T2 

rs 

r 2 

yj2 

y/2 



e 

a 

P 

a (3 

(3a 

a(3a 

y/2 

V2 

V2 

r 2 

r 3 

T2 

r 3 

VS i 

y/S i 

-VS i 

-VS i 

VS i 

VS i 

-VS i 


The subgroups ofG(S/Q ) are 


-ffi = {e,o;},i?2 = {e,/?},i?3 = {e, a:/3a}, iL 4 = {e,a(3,pa}. 

The corresponding intermediate fields are 

h = Q {\/2),L2 = Q (r 3 ),L 3 = Q(r 2 ),L 4 = Q(V^i). 

By Example 4.3.2, iS*, i — 1, 2,3, is not norm,al in G(S/Q) so Li /Q is not 
normal, i — 1 , 2,3. TVouj if 4 is normal in G(S/ Q) and so L^/K is norm.al. 

Let S' be a splitting field over the field K for a polynomial /(x) in K[x\. 

Then we call G(S/K) the Galois group of the equation /(x) = 0 or the 
Galois group of the polynomial /(x). For any a 6 G(S/K) and for any 
root a of /(x) in S, 0 = a(f(a)) = f(a(a)). Thus, a(a) is a root of /(x) in 
S. Since a is a AT-automorphism of S, distinct roots of /(x) map onto distinct 
roots. Hence, a acts like a permutation on the roots of G(S/K). Let 7r a denote 
the permutation of the distinct roots of /(x) induced by a. Then the mapping 
a — > 7t q is an isomorphism of G(S/K ) into S n , where f(x) has n distinct roots. 
Example 24.2.14 is one, where G{S/K) ~ S3. 

Let if be a field of characteristic 7^ 3. Consider a cubic polynomial /(x) = 

x 3 + ax 2 + bx + c, where a, b, c £ iC_We eliminate the quadratic tercm by 

SriLtuc. WafAemafLca/ 7 'UfyAucS. 
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substituting u — | for x. Then 

g{u) = (u- f ) 3 + a(u- §) 3 2 + b(u- f) + c 

= u 3 — au 2 + y u — + cm 2 - |a 2 u + y+fru—y+c 

= + +c. 

Hence, r is a root of g(u) if and only if r — | is a root of /(x). 

Now let /(x) = x 3 + 6 x + c € if [x]. Then /(x) is irreducible over if if and 
only if f(x) has no roots in K. Over a splitting field S of /(x) over K , we have 

/(x) = (x - ai)(x - a 2 )(x - a 3 ), 

where 01 , 02,03 E 5. Thus, 


ai + 02 + 03 — 0, 

0102 + 0103 + 0203 = 6, 

Define the discriminant D of /(x) as follows: 

D = [(02 — oi)(o 3 — ai)(a 3 — 02)] 2 . 

Let d = (o 2 — ai)(a3 — ai)(o3 — 02). Then any i-T-automorphism a of S = 
^(01,02,03) leaves D fixed, i.e., a(D) = D since a(d) is either d or ~d. An 
easy calculation shows that 

D = —4b 3 - 27c 2 . 

Theorem 24.2.15 Let /(x) = x 3 + bx + c be an irreducible and separable 
polynomial over the field K. Let S be a splitting field of f{x) over K and G be 
the Galois group of f{x) over K. Then G cz S 3 if and only if D is not a, square 
in K. If D is a square in K, then [S : K) = 3. 


Proof. By the above discussion, D G K. Suppose d E K. Then a(d) — d 
for all a G G. Thus, no a can be an odd permutation. Hence, each a is 
in the alternating group A 3 . Conversely, if a G A 3 , then a(d) — d. Since 
f{x) is separable and irreducible, the roots of /(x) are distinct. Therefore, 
G 7 ^ {e}. Thus, the above argument shows that G = A 3 if and only if d G K. 
Consequently, G = S 3 if and only if d £ K. If d E K, then G = A 3 and |G| = 3 
and so [S : K] — 3 by the fundamental theorem of Galois theory. ■ 


Theorem 24.2.16 Let /(x) = x 3 + bx + c be an irreducible and separable 
polynomial over the fi,eld K. Let S be the splitting field of /(x) over K. Then 
S = K{\/D,r) for any root r of /(x). 

JPuJmjc. 
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Proof. Now [K(r) : K] = 3. If S = K{r), then S = K{\/D,r). Suppose 
S 3 K{r). Then [S : K] = 6 and [5 : K(r )j = 2. Since [S : K] = 6 , G = S 3 , 
where G is the Galois group of /(x) over K and so d <£ K. Since d is a root of 
x 2 — D over K, x 2 — D is irreducible over K. Since 2 and 3 are relatively prime, 
x 2 — D is irreducible over Kir). Thus, S — K{\fD,r). ■ 

Example 24.2.17 Consider the polynomial x 3 — 4x + 2 G Q. Then x 3 — 4x + 
2 is irreducible over Q by Eisenstein’s criterion. Now D = —4b 3 — 27 c 2 = 
—4(—4) 3 — 27(2) 2 = 148. Thus, D is not a square in Q. Hence, the Galois 
group of x 3 — 4x + 2 over Q is isomorphic to S 3 . S — Q(\/l48, r), where r is 
any root of x 3 — 4x + 2. 


24.2.1 Worked-Out Exercises 


0 Exercise 1 Let f{x) = x n — 1 € Q[x], Show that the Galois group of /(x) 
over Q is commutative. 

Solution: Let £ — e ~^, where i 2 = —1. Then the roots of f(x) are 1, £, 
£ 2 , ..., £ n_1 . Clearly K = Q(£) is a splitting field of /(x). Let a, (3 E G(K/Cl). 
Now o;(£) and /?(£) are roots of /(x). Hence, a(£) = £ fc and /!(£) = £ J for some 
k, j\ 1 < k, j < n — 1. Now (a o £?)(£) = £ fc l — (/3 o a)(£). Let y E K. Then y — 
T.U a £ l for some a i £ l < 1 < n - Now (a 0 p)(y) = (a ° WEiEo 1 = 
£?=0 )( ao P)( a £ l ) = EK, 1 a i{ ao 0)(Z l ) = ES) 1 a/£ M - Similarly, (/3 oq)(j/) = 
ErEo 1 Therefore, o; o j3 = (3 o a. Consequently, G{Kj Q) is commutative. 

0 Exercise 2 (i) Find a primitive element for the extension Q(a/ 2, 73) of Q. 

(ii) Find [<2(72, \/3):Q]. 

(iii) Show that Q (72, 73) is a splitting field of some polynomial fix) 
over Q. 

(iv) Prove that Q(\/2, a/3) is a normal extension of Q. 

(v) If F = Q( a/2, a/3), find the group G(F/ Q). 


Solution: (i) u = a/2+a/3 E Q(a/2, a/3). Thus, Q(a/2+a/3) Q Q(\/2, 73). 
Now 72+a/3 E Q(72+a/3). Therefore, € Q(a/2+a/3) and so a/2— 73 6 

Q(a/2 4- a/3)- Since a/2 = ^(2a/2) = ^((a/2 + 73) + (72 — 73)), it follows 
that 72 E Q(a/2 + a/3). Again a/3 = ^((a/2 + 73) — (a/2 — a/3)) shows that 
a/3 E Q(a/2 + 73). Thus, Q(a/2, 73) C Q(72 + 73). Hence, Q(72 + 73) = 
Q(72,73). 

(ii) [Q(72,73) : Q] - [Q(72,73) : Q(72)][Q(72 ) : Q]- Now x 2 — 2 is the 

minimal polynomial of Q(72 ) over Q. Also, x 2 — 3 is the minimal polynomial 
of Q(72 ,73) over Q(a/2) by Example 21.1.21. Hence, [Q(a/2, a/3) : Q] = 
2-2 = 4. 

Pit^Le. 7feoiAe»taftca/ 
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(iii) Let /(x) — ( x 2 — 2)(x 2 — 3) = x 4 — 5x 2 + 6 £ Q[x]. Since f(x) — (x + 
y/2)(x — \/2) (x + \/3) (x — \/3), /(x) splits over Q(\/2, \/3). Thus, Q(\/2, \/3) 
is a splitting field of /(x) over Q. 

(iv) /(x) = (x 2 — 2)(x 2 — 3) is a separable polynomial over Q. Since 
Q(\/2, \/3) is the splitting field of /(x) by Theorem 24.2.11, it follows that 
Q(\/2, \/3) is a normal extension of Q. 

(v) By the fundamental theorem of Galois theory 24.2.13(i), we find that 
\G(F/Q)\ = [F : Q] = 4. Now we know that Z 4 (the cyclic group of order 4) 
and Z 2 x Z 2 (the Klein 4-group) are the only (up to isomorphism) groups of 
order 4. Hence, either G{F/ Q) ~ Z 4 or G{F/ Q) ~ Z 2 x Z 2 . If G(F/ Q) ~ Z 4 , 
then G{F/Q) has only one subgroup of order 2. Thus, by the fundamental 
theorem of Galois theory, there exists only one intermediate field L of F/Q 
such that [L : Q] = 2 . But Q(V2) and Q(\/3) are intermediate fields of F/Q 
such that [Q(\/2) : Q] = 2 and [Q(\/3) : Q] = 2. Hence, G{F/ Q) ^ Z 4 . 
Consequently, GiF/ Q) ~ Z 2 x Z 2 . 

0 Exercise 3 Let u be a complex number such that 1 and u is a root of 
the polynomial x 5 - 1 <E Q[x). Show that G(Q(u)/Q) ~ Z 4 . 

Solution: x 5 — 1 = (x — l)(x 4 + x 3 + x 2 + x + 1 ). Hence, u is a root of 
fix) = x 4 + x 3 + x 2 + x 4 - 1 . By Worked-Out Exercise 1 (page 379), we find 
that fix) is irreducible in Q[x]. From Theorem 24.3.3, Q (u) is a splitting field 
of f(x). Since all roots of f(x) are distinct, f(x) is a separable polynomial. 
Hence, Q(u) is a normal extension of Q. By Corollary 24.2.12, 

\G(Q(u)/Q)\ = [Q(u):Ql = 4. 

Now u,u 2 ,u 3 ,u 4 are the four distinct roots of f(x) and Q(tt) = Q( u 2 ) — 
Q(« 3 ) = Q( w 4 ). Hence, there exists a e G(Q(u)/Q) such that <t(u) — u 2 . 
Thus, 

cr 2 (u) = a(cr(u)) = cr{u 2 ) = a(u)<j(u) — u 4 , 

<J 3 (u) = cr(cr 2 (u)) = cr(u 4 ) = u 8 = V 3 , 

and 

<J A {u) =■ <7 (u 3 ) = U 6 = U. 

So we find that a, <r 2 , cr 3 , and cr 4 are distinct and cr, <r 2 , cr 3 , 6 G(Q(u)/Q). 
Therefore, G(Q(u)/Q) is a cyclic group of order 4. Consequently, C(Q(u,)/Q) ~ 

Z 4 . 

<0 Exercise 4 Show that the Galois group of the polynomial f(x) = x 3 — 5 
over Q is isomorphic to S 3 . 

riLfixi. 
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Solution: Let u > = —. Then uA = Then u = s/5, un; and 

mu 2 are the three distinct roots of /(x). Thus, the splitting field of /(x) over 
Q is Q (u,uw,uw 2 ) — Q (u,w). Q is of characteristic 0. Hence, Q(rt,rr;) is a 
normal extension of Q. Therefore, 

|<2(QO,w)/Q)| = [Q(^,w ) : Q]. 


Now 

[QOb™) : Q] = [Q(u,u;) : Q(u)][Q(u) : Q]. 

The minimal polynomial of to over Q(-u) is x 2 +x-|-l and the minimal polynomial 
of u over Q is x 3 — 5. Consequently, [Q(k,u> ) : Q] = 2 • 3 = 6. Thus, we find 
that G(Q(rt,cj)/Q) is a group of order 6 which is (up to isomorphism) either 
Zq or S 3 . If G(Q(u,u>)/Q) ~ Zq, then G(Q(u, cv)/Q) has only one subgroup 
of order 2 , i.e., 6?(Q(ia, td)/Q) has only one subgroup of index 3. But Q (u,w) 
contains three distinct subfields Q( u), Q(izw), Q( uu 2 ), 

[QM : Q] = [Q(«w) : Q] = [Q(w^ 2 ) : Q] — 3. 

Hence, G(Q(u,u;)/Q) 9 ^ Zq. Consequently, G(Q(u,cj)/Q) ~ S 3 . 

0 Exercise 5 Let p be a prime integer and m be a positive integer. Find the 
Galois group of the polynomial f(x ) = x pm — x over Z p . 

Solution: The roots of f(x) over Z p form the Galois field, say, F with 
p m elements. Now [F : Z p \ = m and F is the splitting field of x pTn — x over 
Z p (Theorem 23.1.2). By Theorem 22.1.12, Z p is perfect. Thus, F/Z p is a 
separable extension. Also, F is a normal extension of Z p . Hence, by Corollary 
24.2.12, we find that \G(F/Z p )\ = m. Define a : F — > F by a(a ) — a p . Let a ,6 
be two distinct elements of F. Then cr(a) — a (b) = a p — bP = (a — b) p ^ 0. 
Thus, cr is one-one. Also, F consists of a finite number of elements. Hence, a 
is also onto F. Now 

a (a + b) = (a + b) p — a p + If — a (a) + a(b) 


and 


a(ab) = ( ab) p ~ oPb 9 a{a)<j(b) 

for all a, 6 € F. Therefore, cr is an automorphism of F. If a E Z p , then a p = a 
and hence cr(a) = a. Thus, it follows that a £ G(F/Z P ). For any positive 
integer A;, a k £ G(F/Z p ) and a k (a) = a p for all a £ F. Since every element of 
F is a root of x pm — x, a m (a ) = a pm = a for all a £ F. Hence, a m is the identity 
element of G{F/Z p ). Suppose for some r, 1 < r < m, a r = e. Then a p = a for 
all a £ F. Thus, every element of F is a root of x pT — x over Z p . Since x p — x 
has at most p T roots, \F\ <p T < a contradiction. Consequently, o(cr) = m 
and so G = (cr) . 

riLfixi. TfeoiAewtaitca/ 
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<0 Exercise 6 Find the Galois group of the polynomial x 4 — 2 over Q. 

Solution: From Eisenstein’s criterion, it follows that x 4 — 2 is irreducible 
over Q. Now u = \/2 is a root of x 4 — 2 . Also, 

x 4 — 2 = (x — v // 2 )(x + </2)(x + i\/ 2 )(x — i\/ 2 ). 

Hence, the splitting field of x 4 —2 is Q(v^2, —\^2, i\/2, —i\f\ 2) = Q(v / 2, 2 \/2) = 
Q(v / 2, i) = Q(u, z). Now 

[QCM) : Q] = [Q( 22 , 2 ) : Q(u)][Q(u) : Q]. 

The minimal polynomial of u over Q is x 4 — 2 and the minimal polynomial of 
i over Q (u) is x 2 + 1. Thus, 

[Q(u,z) ; Q] = 2 - 4 = 8 . 

Also, Q(u, i) is the splitting field of the separable polynomial x 4 — 2. Hence, 
Q(u,i) is a normal extension of Q. Therefore, by the fundamental theorem of 
Galois theory, it follows that |G(Q(u, z)/Q)| = 8 . Now {1, </2, (V2) 2 , (^2) 3 , 
i, i\J 2, i(v^2) 2 , i(v^2) 3 } is a basis of (Q(ti, i) over Q. Let a € Q (u,i). Then 
there exist ao, ai, 0,21 a 3 , a 4 , 05 , a 5 , and 07 in Q such that 

cl = clq + d\ y/2 + < 22 ("V^ 2)2 T a 3 (^) 3 + U 42 + + clqi^ \/2 ) 2 + a 7 i(“\/ 2 ) 3 * 

If o E G(Q(u,z)/Q), then 

o(a) = do T a\a{\/2) + a2Cx(\/2)^ + a 3 o(v // 2 ) 3 + 040 ( 2 ) + 

05 0 ( 2)0 (^ 2 ) + < 26 o( 2 )a( v ^) 2 + a 7 a( 2 )o(v /, 2 ) 3 . 


Thus, a (a) will be known if we determine o(v / 2 ) and 0(2). Since the minimal 
polynomial of \/2 is x 4 — 2 E Q[x] and the minimal polynomial of i is x 2 + 1 E 
Q[x], o(^ 2 ) is a root of x 4 — 2 and 0(2) is a root of x 2 + 1 . Hence, a(v |/ 2 ) is 
one of ^ 2 , — v^ 2 , z-^ 2 , — 2^/2 and 0(2) is one of 2 and —i. It now follows that 
G(Q(u, i)/ Q) has eight elements. The eight elements of G(Q(n, z)/Q) are 
given by the following table 



Now 


(02 o og)(w) = 02(222) = 02(2)02(22) = 2(222) = —22 


(06 o O2) (22) = 05(222) = <26^2)05(22) = —2(222) = 22. 


U-Hxl. 
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Consequently, cx2 oa 6 / aQoo? 2 . Therefore, we find that G(Q(u,i)/Q) is anon- 
commutative group of order 8 . Hence, G(Q(ti, i)/Q) ~ D 4 or G(Q(u, i)/Q) ~Q 8 
Now Qg has only one subgroup of index 4, but there are more than one inter¬ 
mediate field of Q(M)/Q of dimension 4 over Q, namely, Q(-u) and Q (iu). 
Thus, G(Q(u,i)/Q ~ D 4 . 

<> Exercise 7 Find all proper subfields of Q(y / 2, \/3). 

Solution: Let F — Q(\/2, VS). Then from Worked-Out Exercise 2 (page 
513), G(F/ Q) ~ Z 2 x Z 2 . Now Z 2 x Z 2 has only three nontrivial subgroups. 
Each of these subgroups is of index 2 . Since Q(\/ 2 ), Q(\/3), Q(\/ 6 ) are inter¬ 
mediate fields of Q(V% \/3)/Q and [Q(\/2) : Q] — [Q(\/3) : Q] = [Q(\/ 6 ) : 
Q] = 2 , it follows that Q(\/ 2 ), Q(v / 3), and Q(\/ 6 ) are the only intermediate 
fields of Q(\/2, \/3)/Q. Again Q is a subfield of F and Q has no proper sub¬ 
fields. Hence, Q, Q(\/ 2 ), Q(\/3), and Q(\/ 6 ) are the only proper subfields of 
F. 


0 Exercise 8 Find the Galois group of the field extension 

QC^_ -i+^/q 

Find all subgroups of this group and find all corresponding intermediate 
fields in the above extension according to the fundamental theorem of 
Galois theory. 


Solution: Let F — Q(v^5, u), where to = ~ 1 + i v / 3 | p rom Worked-Out 
Exercise 4 (page 514), we find that G(Fj Q) ~ S 3 . S 3 has four nontrivial 
subgroups H\ — {e, (1 2)}, H 2 = {e, (1 3)}, H 3 = {e, (2 3)}, and H 4 = {e, (1 
2 3), (1 3 2)}. The index of H 4 is 2. Hence, the corresponding subfield of H 4 is 

QH- 

Again [S3 : H\\ = [S3 : H 2 ] = [S3 : H3] = 3 and [Q(it) : Q] = [Q(uoi) : Q] = 
[Q {uujS) : Q] = 3, where u = ^5. Let a 4 = u, (12 = uw , and a 3 = uw 2 and 
1 ai, 2 ci 2 ) and 3 ^ a 3 . Now 

(1 2) a\ —> a 2 

a-2 —*• &i 

a 3 —> a 3 . 


Thus, the intermediate field ^corresponding to Hi is Q(nu; 2 ). Similarly, the 
intermediate field corresponding to H 2 is Q (uw) and the intermediate field 
corresponding to H 3 is Q(u). 

TfeatAewtattcn./ rh.y-A.LC-A. 
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24.2.2 Exercises 

1 . Find the Galois group of C/R. Illustrate the Galois correspondence. 

2. Find the degree of the following field extension F over Q, the smallest 
extension N of F normal over Q, and the Galois group of IV/Q. 

(i) F = Q(V2,V2). 

(ii) f = Q(V 5 +^). 

3. Show that the Galois group of the polynomial ( x 2 — 2) (a ; 2 — 3) over Q is 
isomorphic to Z 2 x Z 2 . 

4. What are the possible degrees over Q of the splitting field of x 3 + ax 2 + 
bx + c e Q[x]? For each such degree, find an f(x) of degree 3 in Q[x] 
whose splitting field has this degree over Q. Can a field normal over Q 
be found in each case? 

5. Find the Galois group G of the polynomial x 3 — x — 1 over Q. Determine 
all subgroups of G and find all corresponding subfields of the splitting 
field. Let 01 , 02,03 denote the roots of x 3 — x — 1. Determine Q (d), where 
d = (o 2 — ai)(o 3 - oi)(a 3 - o 2 ). 

6 . Find the Galois group G of the following polynomials over Q. 

(i) ( x 2 — 3a; + l) 2 (a: 3 — 2). 

(ii) x 4 + x 2 + 1. 

7. Show that the Galois group of the polynomial ( x 2 — 2) (a; 3 — 3) over Q is 
isomorphic to 5 3 x Z 2 . Find all subfields of the splitting field over Q. 

8 . Let F be a splitting field of a polynomial f(x) over a field K. Prove 
that the group G(F/K) is isomorphic to a group of permutations of the 
distinct roots of f(x). 

9. Find the Galois group of f{x) = 0 over the field Q, where f(x) — x 3 —7. 

10. Find all intermediate fields of Q(«,\/7)/Q. 

11. Show that the Galois group of the polynomial equation a: 3 — 2 = 0 is 
isomorphic to that of x 3 — 3 — 0 over Q. 

12. Let F = Q( v / 2, %/5, y/7). Find the order of G(F/ Q). 

13. Let F = Q(\/3, \/TI). Find the subgroups of the group G(Fj Q). Find 
the corresponding intermediate fields. Find all normal extensions of Q 
in F. 

“PuJuc. 7^o^Lc#fta£uiaZ “Ph-ySlc-A. 
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14. Let F be a finite field of characteristic p and [F : Z p ] ~ n. Show that 
F/Z p is a Galois extension and G(F/Z p ) is a cyclic group of order n. 

15. Let F be a finite field of characteristic p. Let [F : Z p ] = n. Show that for 
every positive divisor m of n, F has a unique subfield S of p m elements. 
Also, show that F/S is a Galois extension and G(F/S ) is a cyclic group 
of order —. 

771 

24.3 Roots of Unity and Cyclotomic Polynomials 

In Gauss’s epoch-making work Disquistiones Arithmetica.e , Gauss showed that 
the cyclotomic equation x n — 1 — 0 is solvable for every n in the sense that 
the solutions are expressible in terms of radicals. He not only gave a method 
for finding these expressions, but also determined the values of n for which the 
solutions are expressible in quadratic radicals and in so doing he determined 
the values of n for which it is possible to construct a regular n-gon by means 
of ruler and compass. 

Definition 24.3.1 Let F be any field and, n be a positive integer. Let cv E F. 
Then u is called, an nth root of unity if uF = 1. to is called a primitive nth 
root of unity if cF = 1 and u> m y= 1 for all m, 1 < m < n. 

Let F be a field and n be a positive integer. Let u> E F be an nth root 
of unity. Suppose the characteristic of F is p > 0 and p\n. Then n — p k m, for 
some positive integer k and m, such that gcd(p, m) = 1. Thus, (uj Tn — l) p = 

k 

co p 771 _ l = Lu n — l = 0. Hence, ui Tn — 1 = 0 and so lj is also an mth root of 
unity. 

Theorem 24.3.2 Let K be a field and n be a positive integer. Suppose the 
characteristic of K does not divide n. Let G be the set of all nth roots of unity 
in K. Then G is a cyclic group and \G\ divides n. If x n — 1 splits into linear 
factors in K[x], then |G| = n. 


i-l >71 __ 


Proof. Since 1 E G, G ^ <f>. Let a, b 6 G. Then ( ab~ l ) n = a Tl (6" 1 ) 
Therefore, aft " 1 E G. Hence, G is a subgroup of the multiplicative group 
K* = AT\{0}. Since f(x) = x n — 1 E K[x\ has at most n roots in isf, G is 
finite. Thus, by Theorem 23.1.5, G is cyclic. Let F be the splitting field of 
fix) over K. Since the characteristic of K does not divide n, f'(x) — nr " -1 0. 

Consequently, all roots of f(x) are simple by Theorem 22.1.4. Thus, f(x) has 
n distinct roots in F. Let T be the set of all roots of f(x) in F. Clearly T is 
a group, G C T C F\{0}, G is a subgroup of T, and \T\ — n. Since G is a 

subgroup of T, \G\ divides \T\ = n. S upp ose f{%) splits into linear factors in 

r it^Lc. WafAewtafttra/ ' ' 



24.3. ROOTS OF UNITY AND CYCLOTOMIC POLYNOMIALS 


520 


K[x\. Then F = K and so G = T. Hence, |G| = n. ■ 

Let G, K, and n be as in Theorem 24.3.2. Let G = (uj) with |G| = n. 
Then o(u>) = n. Hence, uj is a primitive nth root of unity. Conversely, if uj 
is a primitive nth root of unity, then uj G G, uj n = 1, and uj 171 1 for all 

m, 1 < m < n. Hence, u is of order n and so G = (uj) . Thus, a; is a primitive 
nth root of unity if and only if G = (uj) if and only if o(G) = n. 

Theorem 24.3.3 Let n be any positive integer and K be a field. 

(i) There exists a finite field extension F/K such tha,t F contains a primitive 
nth root of unity if a,nd only if the characteristic of K does not divide n. (Zero 
is not a divisor of n.) 

(ii) Suppose the characteristic of K does not divide n. Let uj be a primitive 
nth root of unity over K. Then K(uj) is the splitting field of f(x) = x n — 
1 G K[x], f(x) has n distinct roots in K(u >), and the roots of f(x) form a 
multiplicative cyclic group H such that H is gen.era.ted by any primitive nth 
root of unity in K(uj). 

Proof. (i) Suppose the characteristic of K does not divide n. Let f(x) — 
x n — 1 G K [rr] . Then f'(x) = nx n ~ l ^ 0. Hence, all roots of f(x) are simple by 
Theorem 22.1.4. Thus, f(x) has n distinct roots in some splitting field. Let F 
be the splitting field of f(x) over K. Then F/K is a finite extension. Let H be 
the set of all nth roots of unity in F. Then by Theorem 24.3.2, H is a cyclic 
group of order n. Let H = (uj) . Then uj G F and o(u;) = n. Therefore, u is a 
primitive nth root of unity in F. 

Conversely, let uj be a primitive nth root of unity in a finite field extension 
F/K. Then 1, uj, uj 2 , ... , co n ~ 1 G F and these are all n distinct roots of f(x). 
Since deg f(x) = n, f(x) has at most n roots. Thus, all roots of f(x) are 
simple. Hence, f(x) = nr” -1 0. Consequently, the characteristic of K does 

not divide n. 

(ii) By (i), there exists a finite field extension F/K such that F contains 
a primitive nth root of unity, say, uj. Since uj is a primitive nth root of unity, 
1 ,uj,uj 2 ,. .. ,u> n ~ l are all distinct elements and are roots of f{x) = x n — 1 G 
K[x\. Thus, f(x) has n distinct roots in K(uj). Hence, K{uS) is a splitting field 
of f{x). By Theorem 24.3.2, the roots of f{x) form a multiplicative cyclic group 
H of order n. Since the multiplicative order of a primitive nth root of unity is 

n, H is generated by any primitive nth root of unity. ■ 

Definition 24.3.4 Let n be a positive integer and K be a fi,eld whose charac¬ 
teristic does not divide n. Let {uj\,uj 2 , ... ,uj m } be the set of all primitive nth 
roots of unity in the splitting fi.eld F of x n — 1 over K. The polynomial 
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is called the nth cyclotomic polynomial over K and F/K is called the nth 

cyclotomic extension. 

In the following theorem, we describe some important properties of cyclo¬ 
tomic polynomials. 

Theorem 24.3.5 Let n be a positive integer and K be a field such that the 
characteristic of K does not divide n. Let $ n (x) be the nth cyclotomic poly¬ 
nomial over K. Then the following assertions hold. 

(i) x n - 1 = Yl d \n, d >o $d(x)- 

(ii) If P is the prime subfield of K, then 4> n (x) E P[x}. 

(Hi) deg4> n (x) = <f>(n). 

Proof. (i) Let w be a primitive nth root of unity over K. Then K(w) is the 
splitting field of x n — 1 € K[x] and all nth roots of unity form a multiplicative 
cyclic group G of order n. Let d be a positive integer such that d\n. Let Gd — 
{a E G ] o(a) = d). Then {Gd \ d > 0 and d\n} forms a partition of G. Clearly 
for any positive divisor d of n, G contains all dth roots of unity and Gd contains 
all primitive dth roots of unity. Hence, 

x n -l -- ELecO-^) 

— ILln, d >0 FLeG d ( x “ u ) 

— Ildln, d>0 ®d(x). 

(ii) Now 4 * 71 ( 2 :) = Ylu>eG n ( x ~ where G n is as defined in (i). We prove 
the result by induction on n. If n = 1, then 4 * 1 ( 2 :) = x — 1 E P[x]. Suppose the 
result is true for all positive integers k, 1 < k < n. Then for all 1 < d < n, d\n, 
4>^( x) E P[x]. Hence, 


f( x ) = II $ <*(z) G 

d\n, 1 <d<n 


By (i), 

x n — 1 = JJ 4* d (x) = f(x)$ n (x) E K[x\. 

d\n, d>0 

Now x n — 1 E P[x\ and f(x) is monic. By the division algorithm, there exist 
q(x),r(x) E P[x] C K[x] such that 

x n - 1 = q(x)f(x) + r(x), 

where either r{x) — 0 or degr(j:) < deg f(x). Hence, by the uniqueness of 

quotients and remainder in K\x 1, r(x) and ^.(x) = q(x) E P\x\. 

L J W P/^l ±lc^L 
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(iii) 

number of distinct primitive nth roots of unity 
number of distinct elements of G of order n 
number of generators of G 

■ 

We now examine cyclotomic polynomials over Q. Suppose that u> is a com¬ 
plex root of unity. Then for some positive integer n, \u\ n = |u> n | = 1. Hence, 
|cu| — 1 and so u> lies on the unit circle in the complex plane. Also, u must 
be of the form e~ l = cos ^ + i sin for some integer k, 0 < k < n. Thus, 
there are exactly n nth roots of unity. These roots of unity divide the unit 
circle into n equal arcs, from which we get the word “cyclotomy.” 

Theorem 24.3.6 Let u £ C be a primitive nth root of unity over Q. Let 4> n (:r) 
be the nth cyclotomic polynomial over Q. Then the followinq assertions hold. 

(i) 4>n(x) e Z[x], 

(ii) 3> n (:r) i- s irreducible over Q. 

(iii) [Q(w) : Q] = (f>{n). 

(iv) G( QH/Q) —U n . 

Proof. (i) We prove the result by induction on n. If n = 1, then $i(x) = 
x — 1 £ Z[x\. Suppose the result is true for all positive integers K i < k < n. 
Then for all 1 < d < n, d\n, $d{x) £ Z[x]. Hence, 

f(x) = $ d (x) € Z[x\. 

d\n , l<d<n 


deg4> n (x) = 


By Theorem 24.3.5(i), 

x n -l= [I W = /W^WeQW. 

d\n , d>0 

Now x n — 1 £ Z[x\ and f(x) is monic. By the division algorithm, there exist 
q(x),r(x) £ Z[x\ C Q[x] such that 


x n — l — q(x)f(x) + r{x 


where either r{x) = 0 or degr(x) < deg fix). Hence, by the uniqueness of 
quotients and remainder in Q[x], r(x) = 0 and $ n (x) = q(x) £ Z\x\. 

(ii) By Lemma 16.2.8, it is sufficient to show that 4> n (:r) is irreducible over 
Z. Suppose f{x) £ Z[x] is an irreducible factor of $ n (i). Let 4> n (a;) = f{x)h{x) 
for some h(x) £ Z[x\. Since 4> n (x) is monic, both f{x) and h(x) can be taken 
to be monic. Let to be a root of fix). Then to is also a root of $ n (x) and 

hence cu is a primitive nth root of unito^ Let pT>e a prime such # that pjipes 

ritte WarAewtafLca 
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not divide n. Then gcd(p,n) = 1 . Hence, lo p is also a generator of G , where G 
is the multiplicative cyclic group of all nth roots of unity. Thus, u> p is also a 
primitive nth root of unity. We now claim that u) p is also a root of f(x). 

Suppose u) p is not a root of f{x). Since to p is a root of $ n (x), cu p is a root 
of hix). Therefore, a; is a root, of h{x p ). Since f(x) is irreducible over Z and 
hence over Q and a; is a root of f(x), f(x)\h(x p ) by Corollary 21.1.9. Hence, 
h(x p ) = f(x)g(x) for some g(x) G Q[x]. Since f(x), h(x p ) G Z[x], we can 
conclude that g{x) G Z[x\ by using the division algorithm (as in (i)). For 
t(x) G Z[x], let t(x) be the corresponding polynomial in Z p [x], i.e., if a G Z is 
a coefficient of t(x), then [a] G Z p is a corresponding coefficient of t(x). Since 
the characteristic of Z p is p, h(x p ) = {h,{x)) p . Thus, 

( h{x)) p = h{x p ) = f{x) g{x). 

Hence, f{x) and h{x) have a common irreducible factor. Now 

$n(z) = f(x) h(x) 

and <3? n (x)|(:r n — 1), Therefore, x n — [1] G Z p [x] has a multiple root. Let a be 
a multiple root of t(x) = x n — [1]. Then t'(a ) — na n_1 = 0. Since p does not 
divide n, [n]a n_1 = 0 implies that a n ~ l = [0] and so a = [0]. But [0] is not 
a root of x n — [1], which gives the desired contradiction. Thus, u> p is also a 
root of fix). By induction, we can show that u p is also a root for any positive 
integer r. By induction, we can also show that u p i Ps is also a root of f{x), 
where the pfs are distinct primes such that pi does not divide n and the t % are 
positive integers. From this, it follows that for all k, 1 < k < n, gcd(/c,n) = 1, 
<jj k is a root of f(x). Since 

{uj k | 1 < k < n, gcd (k,n) = 1 } 

is the set of all primitive nth roots of unity, every primitive nth root of unity 
is a root of f(x). Hence, $n{ x ) — f( x ) an d so & n {x) is irreducible over Z. 

(iii) Clearly Q(cj)/Q is a finite normal separable extension. Thus, by Corol¬ 
lary 24.2.12 and Theorem 24.3.5, 


|G(Q(«)/Q)| = [QM:Q] = tf(n). 


(iv) Now for any a G G(Q(cu)/Q), o~(u>) is a primitive nth root of unity. 
Hence, cr(co) = u) d for some d, 1 < d < n, and gcd(d, n) = 1. Also, a is 
determined if cr(cj) is determined. We denote this a by < 7 ^. It can be easily 
verified that if c, d are integers such that 1 < c,d < n, gcd(c, n) — 1 , and 
gcd(d, n) = 1, then a c d = cr c o a Define 


T : U 


n 


^ifLajLi±cjftx. uLlcjclL “PhyS-LcS. 
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by T([d]) — a d . Then 'L is one-one function from U n onto C?(Q(a>)/Q). Let 
[c], [d] € U n . Then cd = qn + r for some integers q and r, 0 < r < n. Then 
[cd\ = [r] and a cd (u}) = = u qn+r = cj r = a r (cu). Therefore, a c d = a T . Thus, 

^([c][d]) = T([cd]) = T([r]) = a T = a cd = a c oa d = T([c]) oT([d]). Hence, T is 
a homomorphism. Consequently, 


G(QH/Q) ~ I/„. 


Corollary 24.3.7 Let n be a positive integer. Then for every positive divisor 
m of n , 

x 71 - 1 . 

— e Z W- 


a: 


m 


Proof. By Theorem 24.3.5, 




"-1= n *«(*), 

d\n, d >0 


where $ d ( x ) is the dth cyclotomic polynomial over Q. Let m be a positive 
divisor of n. Then 


Z n -1 = Ud\n,d>oM^) 

ri(i|n, d>m ‘ n,s]m, s>0 ^s(^) 

^ (x m -l)Ud\n,d>mM x )- 


Hence, 


X II 

d 1 7~i y d ^ t 1 1 


X 


m 


By Theorem 24.3.6, $ d { x ) G Z[x\ for every positive integer d. Thus, 

x n - 1 


x 


m 


n ®d(x)€ Z[x\. 

d\n, d>m 


Corollary 24.3.8 Let n be a positive integer. Then for every proper posi¬ 
tive divisor m of n, <L n { x ) divides where & n (x) is the nth cyclotomic 

polynomial over Q. 


Proof. As in the proof of Corollary 24.3.7, 


*”rT= n **(*)■ 

d|n, d>m 


X 


m 


Hence, 


X = $n{ x ) n ^d( x )‘ 


r»77i 
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Thus, $ n (z) divides fS-y- ■ 

We now remove the assumption of commutativity in Theorem 10.1.23 as 
promised. 

Theorem 24.3.9 (Wedderburn) A nontrivial finite ring D without zero di¬ 
visors is a field. 


Proof. We have already seen in Corollary 10.1.24 that a finite integral 
domain is a field. Hence, it suffices to prove that D is a commutative ring. 
Since D is finite, D has prime characteristic p and contains Z p . Set F = {a | 
a £ D, ad = da for all d € D}. Now 0,1 £ F so that F / 0. Let a, b £ F. Then 
(a — b)d = ad — bd = da — db = d{a — b) for all d 6 F. Thus, a — b <E F. For 
6/0, ( ab~ l )d = a(b~ 1 d) = a(db _1 ) = d(ab~ 1 ) for all d € F since from bd = db , 
we can obtain db -1 = b~ l d by multiplying on the left and right by 6 _1 . Hence, 
a6 -1 (E F so that since F is clearly commutative, F is a field in D. Now D is 
a vector space over F of finite dimension, say, n. Let q denote the number of 
elements in F. Then D has q n elements and the multiplicative group G of D 
has q n — 1 elements. 

Suppose n > 1. We shall obtain a contradiction. For any g € G, g F, 
we set D g = {d \ d £ D,dg — gd}. Then as above D g is a division ring and 
clearly D g D F. Since D is also a vector space over D g , we have that D g 
contains q d elements for some positive integer d, which must divide n. Thus, 
the multiplicative group G g of D g has order q d — 1 . Now G g is the normalizer 
of g in G and hence the number of conjugates of g in G is the index °f 
G g in G. Decomposing G into conjugacy classes, we thus obtain 

«"-i = (*-D+£fTp 

where the sum is taken over a finite set of proper divisors d{ of n. Let $ n (^) 
be the nth cyclotomic polynomial over Q. By Corollary 24.3.8, 4?n( x ) divides 
. Also, by Corollary 24.3.7, G Z[x}. Thus, $„(<?) is an integer divid- 

1, x — J_ gc x 1 

Tl 1 

ing q n — 1 and all the 1 and so also dividing q — 1. But $n(?) = n(<7 — u j) 
and so we obtain 

l$n(g)| = Y[\{q-Uj)\ >q- 1 

since \q — u>j\ > q — 1 > 1 for all j and since q > 2. But this is contrary to the 
statement that 4> n (g) divides q — 1 . Hence, n = 1 and so D = F. ■ 


24.3.1 Worked-Out Exercises 


0 Exercise 1 Let w be a primitive eighth root of unity over Q. Describe 

(x). 
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Solution: By Theorem 24.3.5, 


x 8 — 1 = $i(x)$ 2 (x)$ 4 (x)$s(x). 


Thus, 


$ 80 ) = 


0 8 - 1 ) 


$l0)$2 0)3*40) 

Now 4>i0) = x — 1, $2 0) = £ + 1 and $40) = x 2 + 1. Hence, 

$ 8 0 ) =-~ ^-= x 4 + 1 

} o-i)0 + i)0 2 + i) ' 

0 Exercise 2 Let n be a positive integer and w be a primitive nth root of 
unity over Q. Show that 

[Q( W + I):Q] = M. 

OJ z 


Solution: By Theorem 24.3.6, [QOO : Q] = 4>{ n )- Now Q C Q(u> + C 
QO). Therefore, 

[QM : Q] = [QM : Q(« + -)][Q(w + -) : Q] (24.9) 

LO LO 

By Corollary 24.2.12, C?(QO)/QO 4- ^)) = [QO) : QO + Now 

G(QM/Q(u+-)) C G(Q(w)/Q). 

OJ 

Let cr 6 G( QH/Q). Since crO) is a primitive nth root of unity, <j{u) = 
to d , where 1 < d < n and gcd(d, n) = 1. If d = 1, then cr is the identity 
automorphism. Suppose d ^ 1 . Also, suppose cr £ G(QO)/QO + ^)). Then 
crO + -) = lo + —. Hence, co d + O = a (to + — )■■ = lo + —. From this, it follows 

that u d — lo = ^ — Jj, i.e., Lo d — co = ^ Jd~ l • Thus, u;(u ; d-1 — 1) = • Since 

Lo d ~ l — 1^0, <o d+1 = 1. Hence, n = d + 1 since o(lo) = n. Thus, d = n — 1. 
Therefore, the only elements of G(QO)/Q) which fix each element of QO + ~;) 
are the identity automorphism and the automorphism cr given by <t(lo) = u> n ~ 1 . 
Consequently, 

[QM:Q(u + i)]= G(QH/Q(o.+ i)) =2. 

LO LO 

Now [Q( to) : Q] = 0(n). Hence, from Eq. (24.9), it now follows that 


[Q(w+i) : Q] = M 

OJ A _ _ 

“Puluc- TfeoiAewta^Lca/ 
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<0 Exercise 3 Let K be a field of characteristic 0 and n be a positive integer. 

Let w be a primitive nth root of unity in some field extension of K. Show 

that 

(i) K[u)/K is a normal extension and 

(ii) G(K(uj)/K) is commutative. 

Solution: (i) By Theorem 24.3.3, K (u>) is the splitting field of x n — 1 E 
K[x}. Hence, by Theorem 24.1.3, K{uS)/K is a normal extension. 

(ii) Since the characteristic of K is 0, K{co)/K is separable. Since K(u>)/K 
is also a normal extension, it follows that K(u)jK is a Galois extension. Let 
a, j3 E G(K(u>)/K). Now a(u) and (3{u ) are roots of x n — 1. Thus, <a(oj) = a / 
and (3{uj) = uj j for some i and j. Clearly (a o (3){lo ) = ((3 o a)(a;). From this, it 
follows that a o j3 = j3 o a. Therefore, G(K{uS) / K) is commutative. 


24.3-2 Exercises 

1 . Find the Galois group of fix) = x 2 — x + 1 over Q. 

2. Show that the Galois groups of x 4 — 1 and x 2 — x +1 over Q are isomorphic. 

3. Let p be a prime and ® p {x) be the pth cyclotomic polynomial over Q. 
Show that 

4>p(x) = 1 + x 4-b x p ~ l . 

4. Let n be a positive prime. Show that $2 n (x) = Q n (—x), where 4> n (x) is 
the nth cyclotomic polynomial over Q. 


5. 


Let n be a positive integer. Let p be a prime such that p does not divide 
n. Show that 



$n(z) ’ 


where Qpnix) and Q n {x) are the pnth and nth cyclotomic polynomials 
over Q, respectively. 


6 . Find a polynomial irreducible over GF( 3) having a primitive eighth root 
of unity as one of its roots in GF{ 9). 


7. Let K be a field of characteristic 0. Let 0 7 ^ a E K and f{x) = x n — a, 
where n is a positive integer. Let F/K be a field extension such that 
f{x) splits over F. Show that F contains a primitive nth root of unity. 


8 . Let m and n be relatively prime positive integers. 

(i) Show that the splitting field of x 17171 — 1 over Q is the same as the 
splitting field of ( x m — l){x n — 1) over Q. 

(ii) From (i), deduce that (p(mn) = <p(rn)<f{n). * # 

P*u*uc. r4y.Ai.cJ. 
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9. Let m and n be relatively prime positive integers. Let u m and cu n be the 
primitive rath and nth roots of unity, respectively. Show that Q(ca m ) n 
Q(w n ) = Q. 

10. Let K be a field with characteristic not dividing n and F be the splitting 
field of x n — 1 over K. Prove that F contains exactly 0(n) primitive nth 
roots of unity, where <fc is the Euler ^-function. 

11. Let n be a positive integer, if be a field containing all nth roots of unity, 
and 0 ^ a E K. Let F be the splitting field of f(x) = x n — a E K[x\ and 
b be a root of f(x). 

(i) Show that F = K{b). 

(ii) Show that the Galois group G(FjK) is commutative. 


24.4 Solvability of Polynomials by Radicals 

The reader is familiar with the quadratic formula, which says that the roots of 
the polynomial x 2 + bx + c are 


—6 ± V b 2 - 4c 

2 


The only restriction is that the field of which b and c are elements is not of 
characteristic 2. 

By choosing cube roots correctly, the roots of the cubic polynomial x 3 + 
bx 2 + cx + d are 


s + t — g, 
ws + w 2 t - 
w 2 s + wt - 


b 

V 

3 ’ 


where w ^ 1 is a cube root of 1, 


s = 




si 

4 ’ 


t 

P 

q 


3 / — g 
2 


p 3 

27 


+ ^ 
^ 4 ’ 


C — 

2 b 3 
27 


P 

3 ’ 

-$ + *■ 


The field containing 6, c, d is not of characteristic 2 or 3. 

In a similar manner, there exists a formula for the roots of a quartic poly¬ 
nomial. This formula is also given in terms of combinations of radicals of ra¬ 
tional functions of the coefficients. Abel showed that no such general formula 
be given for the roots of fifth degree or higher degree polynomials. This 


can 
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does not mean that no such formula exists for certain polynomials of degree 
5 or larger. Evariste Galois determined exactly for which polynomials such a 
formula exists. Galois’s theory, polished by Emil Artin, is considered to be one 
of the most profound and beautiful works in the history of mathematics. 

Consider the cubic polynomial x 3 + bx 2 + cx + d over a field K of charac¬ 
teristic not equal to 2 or 3 and consider the chain of fields 


K C K(u) C K{u, +u)C K(u , 



)CF, 


where u = y §7 + an< ^ F = K(u, 2 + u, yj— | — u), w). Then F contains 
the roots of the polynomial x 3 +bx 2 +cx+d. Also, ±u are roots of the polynomial 
X 2 — U 2 , yj ~2 + u is a rOOt of X 3 — ( — | + u) , yj ~2 ~ u I s a root of X 3 — ( — | — u) , 

and w is a root of x 3 — 1 . That is, F contains the splitting field of x 3 +bx 2 +cx-\-d 
over K and F is obtained by successive adjunction of roots of a polynomial of 
the form x n — a. In this sense, we mean that x 3 + bx 2 + cx + d is solvable by 
radicals. 


Definition 24.4.1 A finite field extension F/K is called an extension by 
■ & 

radicals (or radical extension) if there exists a finite chain of fields 

K = K 0 C Ki C • • • C K m = F (24.10) 

such that K{ = Ki-\(r {), where ri is a root of x Ui — a {, € AT^-i, /or som.e 
positive integer n{ (i = 1,2, ... ,m). T/?.e polynomial f(x) € AT [a:] (or the equa¬ 
tion f(x) — 0 ) is called solvable by radicals if its splitting fi,eld is contained 
in an extension by radicals of K. 

A chain of fields like that in (24.10) is called a root tower. 

A question immediately comes to mind. If a polynomial is solvable by 
radicals, is its splitting field automatically a radical extension? The answer 
to this question is “no.” Let f(x) — x 3 — 4x + 2. Since deg/(x) = 3, f(x) is 
solvable by radicals over Q. Now /(0) > 0 and /(1) < 0. Hence, the graph of 
f(x) must cross the x-axis three times. Thus, fix) has three real roots. Hence, 
a splitting field F of f{x) over Q lies in R. We will not show it here, but F is 
not a radical extension since [F : Q] is not a power of 2. The interested reader 
may find the details worked out in Isaacs. 

The following is immediate from the definition of an extension by radicals. 

Lemma 24.4.2 Let K C L C F be a chain of fields such that LjK and F/L 
are radical extensions. Then F/K is a radical extension. ■ 

Theorem 24.4.3 Let K be a field of characteristic 0 and F/K be an extension 

by radicals. Let K = Kq C K\ C ■ • • C AT , = F_be the chain of intermediate 

~7VfM* Ph.y.±l^A 
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fields such that K z = K l -\{r l ), where r\ is a root of x ni — a 2 , a z E Ki-i, for 
som.e positive integer n t ( i = 1,2, Then there exists a finite chain of 

fields 

K = F 0 c F x c • • • c F m = E 

such, that Fi/K is a norm, a, l radical extension, x ni — ai splits over Fi , and, 
Ki C F 2 for all i, 1 < i < m. 


Proof. Let Fq = F. Suppose we have constructed a chain of fields 


K = F 0 CF 1 C...CF i 

such that Fj/K is a normal radical extension, x n i — aj splits over Fj, Kj C Fj 
for all j , 1 < j < i. Let G — G(Fi/K) = {e = cri,<J 2 ,... , cr s }. Now r i+ i is a 
root of x ni+1 — ai + i E Ki[x) C Ffix]. Hence, — a* + i E Fi. Consider the 

polynomial 

g{x) = Or 71 '* 1 - < 7 i(a^ + i))(a : ni + 1 - cr 2 (a*+i)) ■ • • ( x Ui+l - cr s (a I+ i)) € Ffix}. 
Now -(cr 1 (oi + i)+ ••• +cr 5 (a i+ i)), 

(<Ji(a i+ i))(o- 2 (ai+i)) + (<7i(a i+ 1 ))(<7 3 (a i+1 )) 4-+ {a 3 -i K + 1 ))(cr s (a i+1 )),..., 

(—l) A: ((Ji(ai + i))(cr 2 (ai + i)) • • • (cr s (ai + i)) are the coefficients of g(x), each of 
which is fixed under oi,... ,cr s . Since K is the fixed field of GiFijK ), g(x) € 
K[x\. Let Fi+i be a splitting field of g{x) over Fi. Then by Worked-Out Ex¬ 
ercise 3 (page 502), Fi+\/K is a normal extension. Consider the polynomial 
x ni+1 — <Ti(a,i + i) E Fi[x]. Let ci, c 2 ,..., c nf+1 be the roots of x ni+1 — a\{ai + i). 
Then c ™ 7+1 E F%, 1 < j < n^ + Thus, we have a chain of fields 

Fi C jPj(ci) C Fj(ci,c 2 ) C-.-C Fi(ci,c 2 ,.. • ,Cn i+1 ) = F^. 

Clearly Fn is a radical extension of F*. Similarly, we can obtain a radical 
extension F l2 /Fn by adjoining the roots of the polynomial 

(x ni+1 -CT 2 {a i+ i)) E Fi[x\ C Fii[®]. 

Continuing like this, we obtain a chain of fields 


Fi C Fn C Fi 2 C • • ■ C Fi S = Fi+i, 

such that Fi t+i is a radical extension of Fa obtained by adjoining roots of the 
polynomial ( x 71i + 1 — at+i(a l+ 1 )). By Lemma 24.4.2, F^+i is a radical extension 
of Fi and hence of K. Since r^ + i is a root of g(x ), r^+i E F* + j, and hence 
Ki + 1 = Ki(ri + 1) C F l (ri + i) C F*+i . Therefore, we have a chain of fields 


K = F 0 CF l 




24.4. SOLVABILITY OF POLYNOMIALS BY RADICALS 


531 


such that Fj/K is a normal radical extension, x nj — a 3 splits over Fj, Kj C Fj 
for all j, 1 < j < i + 1- Proceeding as above we obtain a finite chain of fields 

K = F 0 C Fi C • ■ • C Fm = E 

such that F Z JK is a normal radical extension, x Ul — a* splits over F{, and 
Ki C F % for alH, 1 < i < m„ ■ 

The following corollary is immediate from Theorem 24.4.3 

Corollary 24.4.4 Let K be afi,eld, of characteristic 0. LetF/K be an extension 
by radicals. Let K = Kq C K\ C ■ • • C Km — F be the chain of intermediate 
fields such that Ki — Ki-iirf), where ri is a root of x ni — a^, G i for 
som.e positive integer rii (i = 1,2 ,,m). Then there exists a root tower 

K = F 0 CF 1 C...CF TTl = E 

such that K C F C E and EjK is a norm,al extension. M 

Theorem 24.4.5 Let K be a field of characteristic 0. Let F/K be a normal 
radical extension with root tower 


K = K 0 C Ki C • • • C K m = F 

such that Ki = K z -\(ri), where ri is a root of x ni — ai , G Ki-\ for some 
ni (i = 1,2, ..., m). Let n = n\U 2 ■ • • n m . Suppose K contains all nth roots of 
unity. Then G(F/K ) is a solvable group. 

Proof. Now for alii, 1 < i < m, Ki contains all n^+ith roots of unity. Let 
1 = u>i, u>2, ■ • •, UJ n i+1 be the distinct n^+ith roots of unity. Then r l+ \ = r l+ 
ri + iu> 2 , .. ., r^+io >n i+ i are the distinct n 2 +i roots of x Ui+1 — a*+i G Ki[x\ and 
clearly all these roots are in K z+ \ = Ki(r t+ \). Hence, K l+ 1 is the splitting field 
in F of x ni+1 — ai + i over Ki. Thus, G{Ki + \/Kf) is a commutative group by 
Exercise 11 (page 528). Let Gi = G{F/Ki). Then each Gi is a subgroup of Gq 
and we have the chain of subgroups 

Go DGi D DGm = {e}. 

By the fundamental theorem of Galois theory, 

G(K i+1 /Ki) ~ G(F/Ki)/G(F/K i+1 ) = G l /G t+l . 

Thus, GijGi+i is a commutative group. Hence, Go = G(F/K) is solvable. ■ 

Theorem 24.4.6 Let K be a field of characteristic 0 and fix) be a polynomial 
in K[x\. If f{x) is solvable by radicals, then the Galois group of f(x) over K 
is solvable. 
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Proof. Let E be the splitting field of f{x) over K. Let F/K be a radical 
extension with root tower 


K = K 0 C Ki C ■ • • C K m = F 

such that Ki = iG-i(^), where r* is a root of x ni — a*, a* € for some 

positive integer n* (f = 1, 2,..., m) and E C F. By Corollary 24.4.4, we may 
assume that F/K is a normal extension. Let n = n\n 2 ■ • • n m . 

Suppose K contains all nth roots of unity. Then G(F/K ) is solvable by 
Theorem 24.4.5. Clearly E/K is a normal separable extension. Hence, by the 
fundamental theorem of Galois theory, G(F/E ) is a normal subgroup and 

G(E/K) ~ G(F/K)/G(F/E). 

Thus, G{E/K ) is a homomorphic image of a solvable group. Hence, G(E/K ) 
is solvable. 

Now suppose K does not contain all nth roots of unity. Let u be a primitive 
nth root of unity over K. Let K' = K(oj). Then K' / K is a normal extension 
and K' contains all nth roots of unity and G(K'/K) is commutative by Worked- 
Out Exercise 3 (page 527). Thus, G(K'/K) is solvable. Suppose uj ^ F. Let 
F' = F(uj). Then F'/F is a normal extension and F' is a splitting field of 
x n — 1 E K[x\ over F. Hence, by Worked-Out Exercise 3 (page 502), F'/K is 
a normal extension. Clearly 


K = K 0 C K\ C • • • C Km = F C F f 

is a root tower and so F'/K is a radical extension. Also, E C F C F'. 
Therefore, we may assume that u> E F. Now F/K' is a normal extension since 
F/K is a normal extension. Also, 

K' = K' 0 CK[C...CK / m = F 

is a root tower such that K[ — K' i _ l (ri) : where is a root of x ni — a^, G 
Ki-i C K [_i for some n^ [i = 1,2, ..., m). Consequently, by Theorem 24.4.5, 
G(F/K') is solvable. By the fundamental theorem of Galois theory, 

G(K'/K) ~ G(F/K)/G(F/K'). 

Hence, G(F/K) is solvable. As in the previous case, G(E/K) is solvable. ■ 


To obtain the result of Abel that the general polynomial of degree n > 5 
is not solvable by radicals, it suffices to find a polynomial of degree n whose 
Galois group is S n because S n is not solvable for n > 5. 

We proceed to find such a polynomial. Consider the polynomial ring F[x i, 
..., x n \ and its field of quotients Fix i, x n ). Lat S n be the symmetric 


... x n ). Let o n be the symmetric gmup 

MafAem.af c.aZ 



24.4. SOLVABILITY OF POLYNOMIALS BY RADICALS 


533 


acting on {1, 2, ..., n}. We can consider S n as a group of permutations acting 
on F(x i,..., x n ) in the following manner: For a E S n and a rational function 
f(xi ,... , x n ) E F{x i,..., x n ), define the mapping 

f{x i,...,z n ) -> /(x Q(1) ,...,x Q(n) ). (24.11) 

We will call this mapping a. By Exercise 3, a is an automorphism of 
F(xi,... ,x n ). The fixed field of Fix i,..., x n ) with respect to S n is the field 
K , where 

K = {f(xi,...,x n ) I f(xi,...,x n ) E F(xi,...,x n ), 

f (*^1 5 • • • ) x n) (■T'c^l) 1 • ■ • j •T'a(n)) foi" a H OL E <Sn} • 

The elements of K are called the symmetric rational functions. Set 


ai 

- IH-1- = ELl x i 


02 




'L)i<j<k X i X j X k 

(24.12) 


X1 X2 ' ‘ ■ x n 



These functions are known as the elementary symmetric functions and 
they are symmetric functions. Note that for n = 2, x\ and a?2 are roots of the 
polynomial i 2 — ait+a2; for n = 3 , x\, £2, and £3 are roots of C —a\t 2 -\-a2t — az] 
and when n — 4 , x\, X2, £3, and x 4 are roots of t 4 — ait 3 + a2t 2 — azt 4 - 04. 
Since ai, ..., a n E K, F(ai> ..., a n ) C K. 

Theorem 24.4.7 Using the above notation, we h,a.ve 

(i) [F(x !,... ,£ n ) : K\ = n\, 

(ii) K = F(a 1 ,... ,a n ), 

(Hi) S n = G{F[x 1 ,.. .,x n )/K). 


Proof. Since S n is a group of automorphisms of F(x 1 ,... , £ n ) leaving K 
fixed, S n C G(F{x\,... ,x n )/K). Thus, by Theorem 24.2.6, 

[F{xi,... ,x n ) : K] > |G(F(xi,.. .,x n )/K)\ > |5„| = n! 

The polynomial p(t) = t 7l -ait 7l ~ 1 +a 2 t n ~ 2 H-b(-l) n a n over F(ai,..., a n ) 

has roots xi, ..., x n and factors over Fix 1 , ..., x n ) into it — x\) • ■ ■ it — x n ). 
Thus, it follows that F[x 1 ,... , £ n ) is the splitting field of p{t) over F(ai, ..., 
a n ). Since p(t) is of degree n, 

[F(xi, ...,x n ) : F(ai ,... ,a„)] < n\ 


Thus, since 


F(ai ,..., a n ) C iFC F{x \,..., x n L 

ritte rPLajLlt.C-Wt.ajt tea/ r4y.Ai.cJ. 
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[F(x i,..., x n ) : F(a\ ,..., a n )] is both greater than or equal to n! and less than 
or equal to n! Hence, we have 

[F(x i : F(gx ,..., a„)] = n\ 

and K = F(a\, ..., a n ), proving (i) and (ii). By Theorem 24.2.13, 

\G(F(x 1 ,...,x n )/K)\=n! 

and since S n Q G(F(x i,..., x n )/K), we have S n = G(F(x i,..., x n )/K ), prov¬ 
ing (iii). ■ 

We have now established our goal. The Galois group of the polynomial p(t) 
over F(a \,..., a n ) is S n and S n is not solvable for n > 5. Hence, p(t) is not 
solvable by radicals for n > 5. 

Theorem 24.4.8 Let G be a subgroup of S p , where p is a prime. If G contains 
a p-cycle and a transposition, then G = S p . 


Proof. If p = 2, then |5 P | — 2 and the result is immediate. Suppose p — 3. 
Let (a b ) and (x y z) (E G , where {a, 6} C {x,y,z}. Then it is easy to show 
that 

G = {e, ( a b ), (a: y z), (x z y), (a b) o (x y z), (a b) o (x z y)}. 

Suppose p — 5. Let a = (a b) and (3 = (x y z u v), where {a, 6} C {a:, y , z, u , v}. 
Then there exists a positive integer n such that j3 n = (abode), where 
{a,b, c, d, e} = {x,y,z,u,v}. It is easily verified that 

P n o a o P~ n — (b c ) 

P 2n o a o j3~ 2n = (c d) 

P 3 n oaoP~ 3n = (de). 

Hence, (a b), (b c), (c d), (d e) e G. Thus, 

(6 c) o (a b) o (b c ) = (a c) G G 

(c d) o (a c) o (c d) — (a d) G G 

(d e) o (a d) o (d e) — (a e) G G 

(c d) o(b c) o (c d) = (b d) e G 
(d e) o (6 d) o (d e) = (6 e) £ (7 

(d e) o (c d) o (d e) = (c e) G ( 7 . 


Hence, G contains the above 10 transpositions. However, these are all the 
transpositions of S$ since ( 2 ) = 10. Since every permutation is a product of 
disjoint cycles and every cycle is a product of transpositions, G contains all the 
permutations of {x,y,z,u,v}. Hence, G — £ 5 . (We ask the reader to consider 


the theorem for arbitrary p.) M 


“PuJuc. 
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Theorem 24.4.9 Let f(x) be an irreducible polynomial in Q[x]. Suppose that 
deg/(a?) = p , where p is a prime. If f(x) has exactly p — 2 real roots and two 
complex roots, then the Galois group of fix) over Q is S p . 

Proof. Let S be a splitting field of f(x) over Q such that Q C S C C. Let 
G denote the Galois group of S/Q. Now p|[.S : Q] and [S : Q] = |G|. We see by 
viewing G as a group of permutations on the roots {rq, rq, ..., r p } that G must 
contain an element of order p , which is necessarily a p-cycle. Let r\ — a + bi 
and 7*2 = a — bi. Then the automorphism a of C, which maps every complex 
number to its conjugate must map S onto S since a is the identity on R and 
o;(ri) = 7 - 2 , afa) = r\. Hence, we see that a 2 is the identity and so is a trans¬ 
position. By the previous theorem, G = S p . ■ 

Although Galois and Abel are most noted for their work involving the 
existence of formulas for finding the roots of polynomials, their approach to 
solving mathematical problems along with that of British algebraists marks the 
birth of modern algebra. Their work resulted in abstract and widely inclusive 
theories. Actually, Lagrange’s work on algebraic equations and especially on 
analytic mechanics anticipated the awakening of the strength of the abstract 
and general approach. It was Hilbert’s work on the foundations of geometry 
(1899) which finalized the abstract approach. 

24.4.1 Worked-Out Exercises 

<0 Exercise 1 Show that the Galois group of the polynomial f(x) = 2x 5 — 
10;r + 5 over Q is S 5 . Conclude that the equation f(x) = 0 is not solvable 
by radicals. 

Solution: We have that f(x) is irreducible over Q by Eisenstein’s criterion. 
Now 

/'( X) = 10 (x 4 - 1 ). 

Hence, f'{x ) has two real roots, namely, 1 and — 1 . Since /(— 1 ) > 0 and 
/( 1 ) < 0 , it follows that f(x ) has three real roots, say, ri,r 2 ,r *3 such that 
r\ < — 1 < r -2 < 1 < r 3 . The other two roots of f(x) are complex numbers. 
Thus, from Theorem 24.4.9, the Galois group of f(x) is S 5 . Hence, by Theorem 
24.4.6, the equation f[x) — 0 is not solvable by radicals. 

24.4.2 Exercises 

1 . Find the roots of the polynomial 2x 3 + 9x + 6 by using the formula for 
the root of a cubic. 


Pit^Le. 7^o^Lc#fta£uiaZ 
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2. In F[x 1 , £ 2 , • • • , X n ], x l, x 2 , • • • , X TI are roots of 

p(x ) — x n — a\x n ~ l + a, 2 X n ~ 2 +-b ( —l) n a ri , 

where the ads are defined on page 533. Demonstrate this result for n = 2 
and n — 3. 

3. Prove that a:, given on page 533, is an automorphism of F{x\, X 2 , ..., x n ) 
and that a fixes F(ai, 02 ,, a n ). 

4. It can be shown that a symmetric polynomial is a polynomial in the 
elementary symmetric functions in x\, X 2 ,..., x n . Express the following 
as polynomials in the elementary symmetric functions in xi,X 2 , X 3 . 

(i) x\ + X 2 + x§, 

(ii) (xi - x 2 ) 2 (xi - x 3 ) 2 (x 2 - x 3 ) 2 . 

5. Show that for every finite group G, there is a field K and a polynomial 
f{x) € K[x\ such that the Galois group of /(x) over K is isomorphic to 
G. 

6 . Find the Galois group of the polynomial x 3 — 3x + 1 over Q. Solve the 
equation x 3 — 3x + 1 = 0 by radicals. 

7. Show that the Galois group of the polynomial /(x) — x 5 — 10x 4 + 2x 3 — 
24x 2 + 2 over Q is S 5 . Is the equation /(x) =0 solvable by radicals? 


“Public. 
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Joseph Henry MacLagan Wed- 
derburn (1882-1948) was born on Febru¬ 
ary 26, 1882, in Forfar, Scotland, the tenth 
of 14 children. His father was a physician. 

In 1898, he matriculated at the University 
of Edinburgh. In 1903, he received an M.A. 
degree with first-class honors in mathemat¬ 
ics. The following year he went to Leipzig 
and Berlin because of the influence on him 
of Frobenius and Schur’s work. He received 
his doctorate of science in 1908. 

In 1909, Wedderburn was appointed pro¬ 
fessor at Princeton University. During World 
War I, he fought for the British army, returning to Princeton after the war and re¬ 
maining there until 1945. Besides being editor of the Annals of Mathematics from 
1912 to 1928, Wedderburn published 38 papers, and in 1934, published a textbook, 
Lectures on Matrices. 

Wedderburn is most noted for the two famous theorems which bear his name. 
He proved both theorems between 1905 and 1908. The structure of algebras over 
real and complex fields had been determined by Cartan and others. Wedderburn was 
interested in determining the structure of algebras over arbitrary fields. He showed 
that a semisimple algebra is a direct sum of simple algebras. Later, in his paper “On 
hypercomplex numbers,” he proved that every simple algebra is a matrix algebra over 
a division algebra. In the second theorem, he proved that every finite division ring is a 
field. His theorem on finite algebras gave a structure of all projective geometries with 
a finite number of points. Wedderburn died on October 9, 1948, in New Jersey. 






Chapter 25 

Geometric Constructions 


25.1 Geometric Constructions 

In this chapter, we consider some problems from geometry. We are concerned 
with constructions in the Euclidean plane that can be made by straightedge 
(unmarked ruler) and compass only. We identify the Euclidean plane with 
R x R. We assume that we are given some length, which we take as our unit 
length, and two points O and X which we label (0,0) and (1,0), respectively. 

Using straightedge and compass, we can do the following in the Euclidean 
plane: 

(i) Draw a line through two given points. 

(ii) Draw a line parallel to a given line and passing through a given point. 

(iii) Draw a line perpendicular to a given line and passing through a given 
point. 

(iv) Draw a circle with a given center and passing through another given 
point. 

We draw a line through O and X and call it the x-axis. Now we draw a line 
perpendicular to the x-axis and passing through O and call this the y-axis. 
Thus, we are able to coordinatize the plane. Hence, we have the x-axis , y-axis, 
origin O = (0, 0)., and the point X = (1,0). 

Given line segments of lengths a and b, using straightedge and compass, we 
can construct line segments of lengths a + 6, a — b, ab, and ab~ l (for b ^ 0). 
Since we have a unit length, using straightedge and compass, we can draw 
a line segment of any integer length in a finite number of steps. Thus, using 
straightedge and compass, we can draw a line segment of any rational length in 
a finite number of steps. We leave these facts as an exercise for the interested 
reader. 

For the construction of a line segment of length ab~ l , we first draw two 
lines through a point P. From P, mark off a point Q on one line of length b 
and then mark off a point U on the same line and same direction of length a 
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from Q. On the other line, mark off a point S from P of length 1. Construct a 
line UV parallel to QS with V on line PS. An argument using similar triangles 
shows SV is of length ab -1 . 



Definition 25.1.1 Let P CR x R. Let l be a line and C be a, circle in R x R. 

(i) (a, b ) is a point in P if (a, 6) € P. 

(ii) l is a line in P if l passes through two distinct points in P. 

(in) C is a circle in P if the center of C is in P and C passes through 
another point in P. 

If C is a circle in P C R x R, then the radius of C is the distance between 
two points in P, where the distance is the Euclidean distance. 

Definition 25.1.2 Let F be any subfield of R. The set 

Pf = {{x,y) | x,y G F} 


is called the plane of F. 


Let p\ — (x\,y\), P2 = (x2,y2) £ Pf- Let l be the line passing through p\ 
and p 2 ■ If x\ = X 2 , then l has the form x = x\. If x\ ^ Z2, then l has the 
equation 


y-y l 


V2 - y i 

X 2 - X\ 


(x - Zi) 


which can be reduced to the form ax + by + c — 0 for some a, 5, c 6 F. Hence, 
a line in Pp is of the form 

ax + by -f c = 0 


for some a, 6, c 6 F. Such a line is said to be a line in F. 

Let C be a circle with center at p\ and passing through p 2 . Then the 
equation of C is 

(x - zi) 2 + (y - yi) 2 - r 2 , 


where r G F is the radius of C. This equation of C can be put in the form 
x 2 T y 2 + ax + by + c ~ 0 for some a, 6, c G F. Hence, a circle in Pp has an 
equation of the form 

x 2 + y 2 + ax ±by + c = 0 

riOix*. maf4a##ia£lea/ Phy. 
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for some a, 6 , c E F. Such a circle is said to be a circle in F. 

Let Pi F be the set of all points of Pp and the points obtained by 

(i) intersecting two lines in Pp, 

(ii) intersecting two circles in Pp, and 

(iii) intersecting a line and a circle in Pp. 

It is easy to verify that two lines in Pp intersect in a point in Pp. 

Let C\ and C 2 be two circles in Pp with equations 

x 2 + y 2 + o.\x + biy + ci = 0 (25.1) 

and 

x 2 + V 2 4- a 2 x + b 2 y + c 2 = 0 (25.2) 

for some a 2 , &i, b 2 , ci, c 2 E F. Subtract Eq. (25.2) from Eq. (25.1) to 
obtain 

(ai - a 2 )x + (bi - b 2 )y + (ci - c 2 ) = 0. (25.3) 

Thus, the points of intersection of Ci and C 2 are the same as the points of 
intersection of either Eq. (25.1) or Eq. (25.2) with Eq. (25.3). Hence, case (ii) 
reduces to case (iii). 

Let l be a line and C be a circle in Pp with equations 

aix 4- biy 4- ci = 0 (25.4) 

and 

x 2 + y 2 + a 2 x + b 2 y + c 2 = 0 (25.5) 

for some ai, a 2 , bi, 62 , Ci, c 2 E F. Eliminate y from Eqs. (25.4) and (25.5) to 
obtain an equation of the form 

ax 2 + bx 4 - c = 0 

for some a, b, c E F. Using our knowledge of the quadratic formula or the fact 
that ax 2 + bx + c — 0 is solvable by radicals, we have that the polynomial 
ax 2 + bx + c has roots in F(y/r) for some positive r € F. (r can be taken to be 
positive since we have assumed the line and the circle intersect.) 

Inductively, we can construct a sequence 

Pf = Po F C Pip c P 2 p c ... c P iF c • •., 

where Pi F is the set of all points of Pi~i F and the points obtained by 

(i) intersecting two lines in 

(ii) intersecting two circles in Pi~i F , and 

(iii) intersecting a line and a circle in Pi-i F . 

Lemma 25.1.3 Let F be a subfield of R. Let a G R. The following conditions 
are equivalent. 

(i) (a,0) G P np for some n > 0 . 

(ii) (a, a) G P mF f or som.e m > 0. 

(iii) (0,a) G Pt F for som,e t > 0. 



25.1. GEOMETRIC CONSTRUCTIONS 


541 


Proof. (i)=>(ii) Let l be the line x — y and C be the circle (x — a ) 2 Ty 2 — a 2 . 
Clearly l is a line in P nF . The center of C is (a, 0) G P np and C passes through 
another point (0,0) G P nF . Hence, C is a circle in P nF . Now l and C intersect 
at (a, a). Hence, (a, a) G P n+ i F . Let m = n + 1. Then (a, a) G P mF . 

(ii)=4>(i) Let l be the line y = —x and C be the circle x 2 + y 2 = 2a 2 . 
Then l is a line in P^ and C is a circle in P mF . Now l and C intersect at 
(a, —a) G P m + i F - Let l' be the line x = a and l" be the line y = 0. Then V and 
l" are lines in P m+ \ F . Now l' and l" intersect at (a, 0) G Pm+2 F ■ Let n — m + 2. 
Then (a, 0) G P UF - 

Similarly, (ii)^(iii). ■ 

Theorem 25.1.4 Let F be a subfield of R. Let a, b G R and (a, 0), ( 6 ,0) G P np 
for some n > 0. Then the following assertions hold. 

(i) (a, 6 ) G P mF for some m > 0. 

(ii) (a ± 6 , 0) G P mF for some m > 0. 

(Hi) (a&, 0) G P mF for some m, > 0. 

(iv) Let 6/0. Then (|, 0) G P mF for some m, > 0. 

(v) Let a > 0. Then (-/a, 0) G Pm F for some m > 0. 

Proof. (i) By Lemma 25.1.3, (a, 0), (a, a), (0, 6 ), ( 6 , 6 ) G Pt F for some t > 0. 
Thus x — a and y = b are lines in p tF . These lines intersect at (o, 6 ) and so 

(a, 6 ) G Pt+i F - Let m = t + 1. Then m > 0 and (a, 6 ) G P mF - 

(ii) Let l be the line y = 0 and C be the circle (z — a) 2 + = 6 2 . Then 

l is a line in P np C P n+lF . The center of (7 is (o, 0) G P njF ^ -Pn+i F and C 
passes through (a, 6) G P n +i F - Therefore, C is a circle in P n +i F . Now l and C 
intersect at (a ± 6,0) G P n +2 F - Let m = n + 2. Then (a ± 6, 0) ^ Pm F • 

(iii) By (i), (ii), and Lemma 25.1.3, (a, 6 — 1), (0,6) G Pk F for some k > 0. 

Then ay = — x Tab is a. line in Pk F since it passes through (0, 6 ) and (a, 6—1). 
Also, y = 0 is a line in P^ F . Both these lines intersect at (< 26 ,0) G P^ + i F . Let 
m — k + 1. Then (a 6 ,0) G P mF . 

(iv) If a — 0, then the result is trivially true. Let a / 0. Now by (i), 
(ii), (iii), and Lemma 25.1.3, (0,a), (a,a(l — 6 )) G Pk F for some k > 0. Then 
bx = a — y and y = 0 are lines in Pk F . These lines intersect at (f, 0) G P*;+i F . 
Let m — k T 1 > 0. Then (f, 0) G P mF . 

(v) Let l be the line y = 1 and C be the circle 

+ (»■- — ) 2 = (—) 2 - 

The center of C is (0, T) e ft, for some k > 0 . Also, C passes through 
( 0 , 0 ) 6 P kF . Thus, l is a line and C is a circle in Pfc F . Now l and C intersect 
at (y/a, 1 ) G Pk+i F ■ Let C' be the circle 
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The center of C' is (>/o, 1) £ -P/c+i F and it passes through (2i/a, 2) for 

some t > k + 1. Hence, C' is a circle in Pt F - Also, y = 0 is a line in Pt F . Now 
C' and y = 0 intersect at (2 y/a, 0) ^ Pt+i F - Thus, (y/a , 0) £ P mF for some 

Definition 25.1.5 Let F be a subfi.eld of R. A real number r is said to be 
constructible from F if (r, 0) £ P np for som,e n > 0. 

The following theorem is immediate from Theorem 25.1.4. 

Theorem 25.1.6 Let F be a subfield of R. Let S be the set of all constructible 
real numbers from. F. Then S is a subfield of R and F C S. Moreover, if a £ S 
and a > 0, then yfa £ S. 

Lemma 25.1.7 Let F be a subfield of R. Let a and b be real numbers. Suppose 
there exist real numbers a\, a 2 ,..., a n and b\, & 2 , • • ■, such, that 

(i) a\ £ F, 

(ii) a) £ F{a\, a 2 ,..., Qi-i), 2 < i < n, and a £ F(aj, a 2 ,..., a n ), 
fmJ 6 2 £ F, 

(iv) b 2 £ 2 <j<m, and b £ F(6 2 , 6 2 ,..., fe m ). 

Then there exist real numbers s\, s 2 , • • ■, s*: such that 

(v) si £ F, 

(vi) si £ F(si,S 2 , ... ,5<-i), 2 < i < k, and a,b £ F(s 2 , 52 , - ■ •. «*)• 

Proof. Let F\ = F and Fj = F(ai, 02 ,..., a 2 -i), 2 < i < n + 1. Let 

k = n + m, Si = a*, 1 < i < n and s n+ j = bj, 1 < j < m. Then 

(a) s 2 = a? £ F, proving (v), 

(b) si = a\ £ F(ai, a 2j ...,a;_i) = F(si,s 2 , • • •, 2 < i < n, 

(c) 4+i — £ F C F(si, s 2 , - ■ •, s n ), 

(d) 4+j = ^ £ F( 6 i, 62 , ---, 6 j-i) C F„+i(fei, b 2i ..., 6 j_i) = F(ai, a 2 ,..., 
a n, &i, & 2 , • • • > fy-i) = F(si, s 2 ,..., s n+; _i), 2 <j < m. Also, a £ F(ai,a 2 ,..., 
an) C F(si,s 2 ,... ,s fc ) and b £ F( 6 2 ,& 2) • • •, b m ) C F(s 2 , s 2 , • • •, Sfc), proving 
(vi). ■ 

Lemma 25.1.8 Let F be a sub field of R. Let r £ R be such that r 2 £ F Then 

(i) for all a £ F(r), (a, 0), (0, a) £ F 2f and 

(ii) for all a,b £ F(r), (a, 6 ) £ P 3p , he., P F (r) £ F 3f . 

Proof. (i) If r £ F, then the result is trivially true. Suppose r ^ F. Let 

a £ F(r). Then a = 6 + cr for some 6 , c £ F. Let / be the line 

TfeaiAewtalLca/ 
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and C be the circle 


(x - 6) 2 + (y- 


9 9 

c r 


1,0 ,c 2 r 2 + 1 


■r = (- 


) 2 . 


2 2 i 

Then l is a line in F. The center of C is ( 6, — L j--) E Pf and it passes through 
another point (6, c 2 r 2 ) E Pf- Hence, C is a circle in Pp. Now l and C intersect 
at a point ( b cr, 0) = (a,0). Thus, by definition, (a, 0) E P\ F . By a similar 
argument, we can show that (0, a) E P\ F . 

(ii) Let a, 6 E Fir). Then by (i), (a,0), (0,6) E P\ F . As in the proof of 
(i) =>(ii) in Lemma 25.1.3, (a, a), (6, 6) E P2 F - Hence, x = a and y = b are lines in 
P2 F which intersect at the point (a, 6). Therefore, by definition, (a, 6) E Pz F . ■ 


Lemma 25.1.9 Let F be a subfield of R. Let x,y E R. Suppose there exist 
real numbers si, s 2 ,..., Sfc such that 

(i) s\ E F, 

(ii) s 2 E F(si,s 2 , ..., Si_i), 2 <i <k. 

Let T = F(si, s 2 , • • •, Sfc). Then Pp C P nF /or some n > 1. 


Proof. If A: = 1, then the result holds by Lemma 25.1.8. Suppose the 
result is true for all i, 1 < i < k. If s^ E F(si, s 2 ,..., then the result 

holds by the induction hypothesis. Suppose s^ F(si,s 2 ,..., Let K = 

F(si, s 2 ,..., Sfc-i). By the induction hypothesis, Pk Q Pm F for some m > 1. 
Clearly, P^ C P m +i F and P 3 K C P m+ 3 F . Let a, 6 E T = K{s^). Then by 
Lemma 25.1.8, (a, 6) E P 3 F C P m+ 3 F . Let n ~ m + 3 > 1. Then Pp C P„ F . ■ 


Theorem 25.1.10 Let F be a subfield of R. Let x,y E R. Then (x,y) E 
P nF for som,e n > 1 if and only if there exists a sequence of real numbers 
s \i s 2i • ■ -1 such, that, 

(i) si € F, 

(ii) s( € F(s\,S2, . .. 2 <i <k, and x,y 6 F(s lt S 2 ,. .., s fc ). 


Proof. Let (x,y) E P nF for some n > 1. We prove the result by induction 
on n. Let n = 1 and (x, ?/) If x, y E F, then the result is trivial. Suppose 

x,y £ F. Then (x,y) is obtained by either intersecting a line and a circle or 
two circles in F. Then, as shown before, x, y E F{y/r) for some r E F. Let 
= r 2 . Then sj E F and x,y E F(si). Hence, the result is true for n = 1. 
Suppose the result is true for all Pk F such that 1 < k < n. Let ( x,y ) E P nF . 
If (x,j/) E P n _i F , then the result holds by induction. Suppose (x,y) £ P n _i F . 
Then (x,?/) is obtained by intersecting two lines or two circles or a line and a 
circle in P n _\ F . Suppose (x, y) is obtained by intersecting a line L and a circle C 
in P n _i F . Then L passes through two distinct points (a, 6), (c,d) E P n -i F and 
C has its center (u.v) E P n -n and the radius r of C is the distance between 
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two points in P n -i F . Now by induction hypothesis and by Lemma 25.1.7, it 
follows that there exists a sequence of real numbers Si, S 2 ,..., s*.-i such that 

(i) 5? G F, 

(ii) s? G F(si, 52, • • •, 5f_i), 2 <i < k- 1, and a, b,c,d,u,v, r G T(si, 52, ..., 

l)- 

Thus, (x, y) is obtained by intersecting a line and a circle in -F(si, S 2 , ..., 
Hence, there exists a real number Sk such that s| G F(si, S 2 , -. -, Sfc_i) 
and x, y G F(si, 52,, 5fc-i)(sfc) — F(si, 52 ,..., s*,). The other cases are sim¬ 
ilar. 

The converse follows by Lemma 25.1.9. ■ 

The following theorem is immediate from Theorem 25.1.10. 

Theorem 25.1.11 Let F be a subfield o/R. A real number r is constructive 
from. F if and only if there exist real numbers r\, r 2 ,..., r n such that 

(i) r\ G F, 

(ii) r} G i ? (ri,r 2 ,... ,ri_i), 2 < i < n, and r G F(ri,r 2 ,..., r n ). ■ 

Definition 25.1.12 A real number a is constructible if it is constructible 
from Q. 

Definition 25.1.13 (i) A point (a,b) is constructible (or located) in the 
Euclidean plane if a and b are constructible real numbers. 

(U) A line segment is constructible in the Euclidean plane if its end 
points are constructible. 

(Hi) A line is constructible in the Euclidean plane if it passes through 
two distinct constructible points. 

(iv) A circle is constructible in the Euclidean plane if its center is con¬ 
structible and it passes through another constructible point. 

Theorem 25.1.14 Let S be the set of all constructible numbers in R. Then S 
is a sub field of R a,nd. Q C S. Moreover if a G S and a > 0, then ^/a G S. 

Proof. The proof follows from Theorem 25.1.6. ■ 

Theorem 25.1.15 The real number r is constructible if and only if there exists 
a finite number of real numbers s\,... } s n such that 

(i) sj G Q, 

(ii) s~ G Q(si,..., i) for i = 2,..., n such that r G Q(si,..., s n ). ■ 

Corollary 25.1.16 If the real number r is constructible, then r lies in som.e 
extension of Q of degree a power of 2. 
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Proof. If r is constructible, then there exist real numbers ,..., s n satis¬ 
fying conditions (i) and (ii) of Theorem 25.1.15 such that r G Q(si, • ■ • ,s n ). 

Now [Q(si,...,s n ) : Q] = [Q(si,..., s n ) : Q(si,..., s„_i)] [Q(si,. -., s n -i) : 
Q(si,...,s„_ 2 )] *--[Q(si) : Q], which is clearly a power of 2. ■ 

Corollary 25.1.17 If the real number r is a root of an irreducible polynomial 
over Q of degree k, where k is not a power of 2, then r is not constructible. ■ 

Theorem 25.1.18 A real number a is constructible if and only if using straight¬ 
edge and compass we can construct a line segment of length |a| in the Euclidea,n 
plane. 

Proof. Suppose using straightedge and compass that we can construct a line 
segment PQ of length |a| in the Euclidean plane. We may assume that a > 0. 

Let P = {x\,y{) and Q = (x 2 ^ yf)- Then P and Q are constructible points in the 
Euclidean plane and hence x\, y\, X 2 , y 2 are constructible real numbers. Hence, 
by Theorem 25.1.15 and Lemma 25.1.7, there exist real numbers si,..., s n such 
that 

(i) s? e Q, 

(ii) s? 6 Q(si,..., Si- 1 ) for i = 2,... ,n such that xi, y\, X2, y2 £ Q( s i> ■ • •, 
s n ). Now a 2 = (xi - x 2 ) 2 + (yi - j/ 2) 2 G Q(si,..., s n ). Let s n+ i = y/a. Then 
^n+l 6 Q(si,-.. , s n ) and a G Q(si,... , s n +i). Hence, a is constructible from 

Q 

Conversely, suppose a is constructible from Q. Then A = (a,0) G P nQ for 
some n > 0, where P nQ is defined as above (here the arbitrary field F — Q). 

Let B = (0,0) G Puq ■ Then A and B are two constructible points in the Eu¬ 
clidean plane. Hence, we can construct the line segment AB in the Euclidean 
plane in a finite number of steps, and AB is of length |o| . ■ 

We have now laid enough groundwork to answer by algebraic methods some 
ancient questions of geometry. 

Theorem 25.1.19 R is impossible to trisect an angle of 60° by means of 
straightedge and compass alone. 

Proof. Suppose that an angle of 60° can be trisected by straightedge and 
compass. Then the real number r = cos 20° is constructible. From the trigono¬ 
metric formula cos 36 — 4 cos 3 9 — 3 cos 6 and by setting 9 =■ 20°, we obtain 
i — 4 r 3 _ 3 r or g r 3 _ g r _ 1 — 0 . Thus, r is a root of the polynomial 
8x 3 — 6 x — 1 = 0 over Q. The possible linear factors of 8x 3 — 6x — 1 over 
Z are (x ± 1), (fix ± 1), (4x ± 1), and (8a: ± 1). However, it is easily verified 
that ±1, ± 7 ;, ±-|, are not roots of 8 x 3 — 6 x — 1 . Therefore, 8 z 3 — 6 a: — 1 

is irreducible over Z and thus over Q,^Thus, bx Corollary 25.1.J.7, r isjaot # 
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constructible. Consequently, it is impossible to trisect an angle of 60°. ■ 

There are some angles which can be trisected by means of straightedge and 
compass alone; for example, angles of 90° and 72°. We ask the reader to verify 
this fact. 

Another ancient problem is that of “squaring the circle,” that is, construct¬ 
ing a square whose area is equal to that of a given circle. Since the area of a 
circle is 7 T r 2 , where r is the radius of the circle, this problem is equivalent to 
the constructibility of 1/7F. However, it can be shown that 7 r, whence -y/ 7 r, is not 
even algebraic over Q, let alone a root of a quadratic polynomial. Hence, it is 
impossible to square the circle. Thus, we have the following result. 

Theorem 25.1.20 It is impossible to square the circle by straightedge and 
compass alone. ■ 

We now consider the problem of “duplicating the cube,” that is, construct¬ 
ing a cube whose volume is twice that of a given cube. If the original cube is 
the unit cube, then the problem reduces to the construction of a real number 
r such that r 3 = 2. Since the polynomial x 3 — 2 is irreducible over Q, we have 
by Corollary 25.1.17 that it is impossible to duplicate a cube. 

Theorem 25.1.21 It is impossible to duplicate the cube by straightedge and 
compass alone. ■ 

Example 25.1.22 Consider a triangle of sides of length 1, l,r, where the side 
of length r is opposite a,n angle of 36°. Then the other two angles are 72° 
each. Draw a bisector from, one of the 72° angles to the opposite side. Similar 
triangles are obtained. The ratios of the corresponding sides yield = K 
Thus, r 2 + r — 1 = 0. Hence, 

-l + \/5 

r = -. 

2 

Thus, r is constructible and so an angle of 36° is constructible. 

Theorem 25.1.23 Let 9 6 R. Then the following conditions are equivalent. 

(i) The angle 9 is constructible. 

(ii) The number cos 9 is constructible. 

(Hi) The number sin 9 is constructible. 


Proof. (i)=>(ii): There exist constructible points p and q such that the 
radian measure of the angle p(0,0)g is 9. Without loss of generality, we may 
assume that q lies on the x-axis. The unit circle then intersects the line contain¬ 
ing q and p at the point r — (cos 9 , sin 9). Thus, cos 9 and sin 9 are constructible 

since r is constructible. _ m _ _ 
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(ii) =^(iii): Since cos# is constructible, the point q = (cos0,0) is con¬ 
structive. We may construct a line containing q and perpendicular to the 
x-axis. This line intersects the unit circle at the point (cos #, sin #). Hence, 
sin # is constructible since cos 6 is constructible. 

(iii) =^(i): Since sin# is constructible, cos# is constructible by an argument 
similar to that of (ii) implies (iii). The line through (0,sin#) parallel to the 
x-axis intersects the unit circle at the point p — (cos#,sin#). Therefore, p and 
so q — (cos#,0) are constructible. Consequently, the angle # is constructible 
since the angle p(0 ,0)g has radian measure #. ■ 

25.1.1 Worked- Out Exercises 

<C> Exercise 1 Let n G N. Let 6 n denote an angle with radian measure —. 

Show that a regular polygon with n sides is constructible if and only if 
the angle # n is constructible. 

Solution: The desired result follows by noting that we may inscribe a 
regular polygon in the unit circle. 

0 Exercise 2 Let #,0 G R and m,n G Z. If # and (f> are constructible, show 
that the angle with radian measure m# + ncf) is constructible. 

Solution: The numbers cos #, sin #, cos 0, and sin (j> are constructible. Now 
sin(m# + n(f)) is equal to an algebraic expression involving cos #, sin #, cos 0, 
and sin </>. Since the set of constructible numbers is a field, sin(m# 4- n(fc) is 
constructible and so the desired result follows by Theorem 25.1.23. 

0 Exercise 3 Let m,n G N. Let # n denote an angle with radian measure 

(i) Show that if # mn is constructible, then # m and 6 n are constructible. 

(ii) Show that if # m and # n are constructible, where m, and n are relatively 
prime, then # mn is constructible. 

Solution: (i) We note that 

2tt2iV 

mG mn = m —- = — = 9 n 
mn n 

and similarly n6 mn — # m . Hence, the result follows from Worked-Out Exercise 
2 (page 547). 

(ii) Since m and n are relatively prime, there exist integers s and t such 
that 1 = sm + tn. Thus, 

27T 27T777,S 4- 27 itn 

9mn = ~ — s9 n + tum- 

mn mn 

Hence, # mn is constructible by Worked-Xlut Exeixise 2 (page 547). 

rufuc- matAcMiatuia/ r4y.Ai.cJ. 



25.1. GEOMETRIC CONSTRUCTIONS 


548 


Exercise 4 Show that the regular 9-gon is not constructible. 

Solution: Suppose that a regular 9-gon is constructible. Then an angle 
of 40° (= 360°/9) could be constructed. However, an angle of 20° could then 
be constructed by bisecting the 40° angle. But this is impossible by Theorem 
25.1.19 since it is shown there that it is impossible to construct an angle of 20°. 

Exercise 5 Show that it is possible to construct an angle of 30°. 

Solution: Since \/3 is constructible, y/3/2 is constructible. Thus, cos 30° 
is constructible and so 30° is constructible. 

Exercise 6 Show that the regular 20-gon is constructible. 

Solution: By Example 25.1.22, an angle of 36° can be constructed. Hence, 
an angle of 18° can be constructed by bisecting the angle of 36°. Since ^ = 18, 
the regular 20-gon is constructible. 

25.1.2 Exercises 

1. Given line segments of length a and 6, show that it is possible by straight¬ 
edge and compass to construct line segments of length a ± 6, ab. 

2. Prove that it is impossible to construct a cube whose volume equals that 
of a given sphere. 

3. Prove that an angle of 40° cannot be constructed. 

4. Prove that it is impossible to construct a regular septagon by straightedge 
and compass. 

5. Prove that the regular pentagon and hexagon are constructible. 

6. Prove that it is possible to trisect angles of 90° and 72°. 

7. Prove that it is impossible to construct a cube whose volume is three 
times the volume of a given cube. 

8. Let n £ N, n > 1. Let n = p ^ • • ■pp' be the prime factorization of n. 
Prove that a regular polygon with n sides is constructible if and only if 
a regular polygon with p e - 1 sides is constructible, i — 1,2, ..., r. 





Chapter 26 


Coding Theory 


26.1 Binary Codes 

In this section, we examine techniques for transmitting information across a 
noisy channel. The information is often represented as a sequence of binary 
digits (0’s and l’s). The channel may be space, as in satellite communication 
systems, or wires or cables, as in the telephone system, or wires as in circuits in 
a digital computer. Erratic currents called noise are always present to interfere 
with transmitted signals. Erratic currents can also be caused by such things 
as sunspots or magnetic storms. The channel noise will occasionally cause a 
transmitted one to be mistakenly interpreted as a zero or a transmitted zero 
to be mistakenly interpreted as a one. In order to reduce the effects of such 
errors, the transmitter may adjoin to the sequence of m (binary) message digits, 
s check digits. 

The s check digits are selected by a method that makes them dependent 
on the m message digits. This is accomplished by mapping the sequence of 
message digits onto a sequence of n = m + s digits called the codeword. This 
function is called the encoding scheme. The codeword is then transmitted. 
The receiver or decoder maps the received word, which may be different from 
the codeword due to channel noise, onto a sequence of m digits. This function is 
called the decoding scheme. Claude E. Shannon is credited as the originator 
of general coding theory. 

The main aim of this section is to discuss the concepts of error detection 
and error correction. 

Throughout this chapter, we let 0 and 1 denote the elements of the field 
Z 2 . For n > 1, let 

B n -Z 2 x Z 2 x • • ■ x Z 2 . 

'-s,-" 

Ti times 

Definition 26.1.1 A binary (rn,n)-code is a A-tuple (B m , B n , E, D), where 
B m is the set of all binary m-tuples, B n is the set of all binary n-tuples (n > 
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m) and E : B m —> B n and D : R B m , where R C B n . The functions E and 
D are called the encoding scheme and the decoding scheme, respectively. 

A nonempty subset of B 171 is called a set of message words. Let X C B™ 
be a set of message words. Then E(X ) is called a set of codewords. These 
codewords are transmitted across a noisy channel. Let X' be the set of received 
words after transmission. These received words are decoded by the decoding 
function D. Then D{X ') is the set of decoded words. We show by the following 
diagram the above coding and decoding process. 



Block diagram of a general data communication or storage system. 


We use the notation c = (c \,..., c n ) £ B n for a codeword and r = (ri,..., r n ) 
£ B n for a received word. 


Example 26.1.2 ((m, m + l)-Parity-Check Code) This code is an error 
detecting code. The encoding function E is defined by 

E(ai,a 2 ,. . • i O'Tn ) = (ai, 0-2 ,.. ., a m , a m+i) 1 

where a Tn+ \ — (ai + a 2 + • • ■ + a m )( mod 2). Then a m +i is 0 or 1, depending 
on whether the number of 1 ’s in ai , a 2 ,..., a m is even or odd. 

For example, let us consider the (3,4) -parity-check code. Then B 3 is the 
set of m.essage words and C = {(0, 0, 0, 0), (0, 0, 1, 1), (0, 1, 0, 1), (0, 1, 1, 
0), (1, 0, 0, 1), (1, 0, 1, 0), (1, 1, 0, 0), (1, 1, 1, 1)} is the set of codewords. 
Any odd, number of errors can be detected, but the code fails to detect an even 
number of errors. 

Example 26.1.3 ((m, 3m) -Repetition Code) In this code, E : B m B 3m 
is defined as 

E(ai,a 2 

j • • • j ) = (ai, a 2 ,..., a m ) > ^2 j > &2 > • ■ • j ^m) ■ 


Let x, y, z £ B m . Then xyz denotes the word w £ B 3m such that the first m, 
letters of w are those of x, the next m letters of w are those of y, and the last 
m letters of w are those of z. Define the decoding function D : £? 3m —>■ B 771 as 
follows: The ith digit of D(w), w £ B 3m , is the m,ember that appears as the ith 
digit, in at least two of the words, x, y, z, where x, y, z £ B m and w = xyz. 

For example, if m, = 3 and a = (1, 0, 1) € B m , then E(a) = (1, 0, 1, 1, 0, 
1, 1, 0, 1). Now w — aaa is a codeword. Suppose that the transmission m,akes 


an error in the sixth digit. Then the received word, say, v, is (1, 0, 1, 1, 0, 0, 

~Pubixz. 
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1, 0, 1). Let x — (1, 0,1), y = (1, 0, 0), z = (1, 0, 1). Now the first digit of 
D{y) is 1 since 1 is the first digit of x, y, a,nd z. The second digit of D(v) is 0 
since 0 is the second digit of x, y , and z. The third digit of D{y) is 1 since 1 is 
the third digit of x and z. Hence, D{v ) = (1, 0, 1). 

We find that this code can detect a single error and can also correct the 
error. It follows that this code can also detect two errors, but it can correct 
only one error. 


Example 26.1.4 (Repetition Code) Let m be an integer and d an even 
integer. Let s = dm. Then n — {d + 1 )m. Define E and D as follows: For 
all a — (<ii,..., a m ) 6 B m , E(a) - c, where for j — 0,1,..., d, Cj m +i — 
i — 1,..., m. That is, a is encoded by breaking it into m-character blocks, each 
of which is transmitted (d+ 1 )m times. For all f £ B n , 


D(r) = 


0 if more than half the rj m+ i are zero, j = 0,1,..., d, 
1 if m,ore than half the r JTn +i are one, j = 0,1,..., d. 


If more than half the digits in a fixed position of a codeword c of Example 
26.1.4 are altered by channel noise, then the decoder will commit a decoding 
error. 

If s were allowed to be odd in Example 26.1.4, then it would be possible 
for the number of zero making up r to be equal to the number of nonzero 
tv In such a case, the decoder may decide not to decode the received word r. 
This is an example of a decoding failure. 

If a decoding algorithm decodes all received words, the algorithm is called 
complete; otherwise it is called incomplete. A decoding algorithm is com¬ 
plete if and only if R — B n in Definition 26.1.1. 

Assume now that errors in transmitting successive digits occur indepen¬ 
dently. Thus, if p is the probability that a given digit will be received correctly, 
then the random variable counting the number of errors in a received word has 
a binomial distribution. That is, the probability p^ of exactly k errors in an 
n-digit received word is pk = {f^)p n ~ k q k , k = 0,1,..., n, where q — 1 — p. This 
simplified mathematical model for the channel is called the binary symmetric 
channel. 

Encoding by Matrix Multiplication 

We now describe a technique for encoding binary words by matrix multiplica¬ 
tion. 

Definition 26.1.5 An m x n matrix (m < n) with entries from Z 2 is called 
a generator matrix if the subm,atrix consisting of the first m columns of this 
m.atrix has rank m. 
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An m x n generator matrix M can be written as a partitioned matrix 
M = ( M" M'), where M" is an m x m invertible submatrix of M and M' is 
the submatrix of M consisting of the last n — m columns of M. 

Let M be an m x n [m, < n) generator matrix. Define the corresponding 
coding function 

rp . D m n»l 

JhM ■ & •—> n) 

by Em (a) — aM for all a £ B m . Here we regard an element a of B m as a 1 x m 
matrix over Z 2 , aM = cG B n , where c — (ci,..., c n ), M = (e^) and 

n 

Cj = ^ a-ieij (mod 2), j — 1, 2,... ,n. (26.1) 

i=l 

Let M" = 7 m , the m x m identity matrix. From Eq. (26.1), we find that 
Cj — aj for j = 1,2,..., m and the submatrix M' of M consisting of the last 
n — m, columns of m determines the check digits to be adjoined to the message 
word a — (ai,..., a m ). 


Example 26.1.6 Let 


M = 


1 0 0 0 1 1 
0 10 10 1 
0 0 1110 


be a generator matrix over Z 2 , This defines an encoding function Em ’■ B 3, 
B G . For example, let a = (1 0 1) € B 3 . Then 


E m (a) -(10 1 ) 


1 0 0 0 1 1 
0 10 10 1 
0 0 1110 


= (1 0 1 1 0 1) 6 B 6 . 


Hence, 101 is a, message word and the corresponding codeword is 101101. 1, 0, 1 
are the check digits which are adjoined to the message word 101. 


Definition 26.1.7 If M = ( M" M f ) is an m, x n generator m,atrix (m < n), 
then the n x (n — m) m.atrix 


\ J 

where J n _ m is the (n — m) x (n — m) identity m.atrix, is called the corresponding 

parity check matrix of M. 

We ask the reader to verify that c is a codeword if and only if cK = 0, 
where K is the parity check matrix. 
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Example 26.1.8 The parity check matrix for the generator m.a.trix of Example 
26 . 1.6 is 

"Oil* 

1 0 1 


0 1 0 
0 0 1 


We now turn our attention to an important class of codes discovered by 
Richard W. Hamming. 


Definition 26.1.9 An (m,n) code (B m , B n , E , D ) is called a Hamming 
(m, n) code if E is defined by a generator m.atrix M such that the rows of the 
corresponding parity check matrix K contain all the 2 £ — 1 nonzero vectors of 
B l , where t = n — m. 


Example 26.1.10 Let 


1 0 0 0 1 1 1 
0 10 0 110 
0 0 10 10 1 
0 0 0 1 0 1 1 


Then M is a 4x7 m.atrix. The corresponding parity check m,atrix is 


K = 


1 1 1 
1 1 0 
1 0 1 
0 1 1 
1 0 0 
0 1 0 
0 0 1 


K contains all the nonzero elements of B 3 . Hence, the encoding function defined, 
by the generator matrix M defines a, ( 7 , 4 ) Hamming code. 


The (m, n)-Hamming code corrects every single error pattern. No other 
errors and no other (2 t — 1 — t , 2* — 1) code can be constructed, which will 
correct more than all single errors. Any received word with two or more errors 
will be decoded as if it had one error. In this case, a decoding error is made. 
This follows since if the received word f has a single error in the ith component, 
then rK is just the fth row of K. Thus, every received word with a single error 
can be corrected because K has n rows, which are nonzero and distinct. 
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Weight and Distance 

Definition 26.1.11 Let a E B n . The weight of a, denoted by wt(a ), is the 
number of 1 ’s in a. 

Example 26.1.12 For 01100 E B 5 , utf(01100) = 2. 

Definition 26.1.13 Let a,& E B n . The distance between a and b, denoted 
by d(a,b ), is defined by 

d{a, b ) = wt(a + b ). 

Example 26.1.14 Fora — 0101 andb= 1110, a + 6 = 1011. Then wt(a + b ) = 
3. Hence, d(a,b) = 3. 

We leave the proofs of the next two results for the exercises. 

Theorem 26.1.15 Let a, 6, c E B n . Then 

(i) d{a , b) = the number of locations i with ai ^ b{. 

(ii) d(a, 0) —wt(a). 

(Hi) d(a , b) = 0 if and only if a = b. 

(iv) d(a, b) = d(b,a). 

(v) d(a, b) + d(b , c) >d(a,c).M 

Theorem 26.1.16 (i) A code ( B m , B n , E , D) can detect all sets ofk or fewer 
errors if and only if the minim,um. distance between any two distinct codewords 
is at least k + 1. 

(ii) For a code to correct all sets of k or fewer errors, it is necessary that 
the minimum, distance between any two distinct codewords be at least 2/c + 1. ■ 

Example 26.1.17 Consider the following set C of codewords in B 6 . 

C = {000000, 001110, 010101, 100011, 011011, 101101, 110110, 111000}. 

The minimum distance between two distinct codewords is 3. Hence, this code 
can detect two or fewer errors. 

Group Codes 

Now ( B m , +) and ( B n , +) are commutative groups, where for both, + is defined 
by componentwise addition (mod 2) of vectors. Clearly \B n \ = 2 n . Let C be the 
subset of B n consisting of all codewords. That is, C = {c | c E B n , cK = 0}. 
Clearly C is a subgroup of B n and \C\ = 2 m . 

We know that the cosets of C in B n partition B n and that the difference 
between any two vectors in the same coset is a codeword. Also, the sum of a 
vector x in B n and any codeword gives another vector in the same coset as x. 

~Pu*ua. 
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After the message is encoded into the full codeword, the codeword c is 
transmitted across the noisy channel. The channel adds to c the error or 
noise word e = (ei,..., e n ), where 


ez 


0 if the channel does not change the fth digit, 
1 if the channel does change the fth digit. 


The received word f is equal to the codeword plus the error word, i.e., r = 
c + e. The set of possible error patterns must be exactly the coset of C in B n 
determined by r. This follows because e = r — c£r + C. Hence, the decoder 
may immediately exclude all error patterns which do not lie in the same coset 
as that determined by r. However, all error patterns in this coset are possible. 
Those error patterns with a smaller number of ones are more probable than 
those with many ones since channel errors are relatively infrequent. An element 
in a coset with the fewest number of ones is called a coset leader. 


Definition 26.1.18 Let ( B m , B n , E , D) be a code. If E(B Tn ), the image of 
B m under E , is a subgroup of B n , then this code is called a linear or group 
code. 

Theorem 26.1.19 Let ( B m , B n , E , D ) be a group code. Then the minimum, 
distance between any two distinct codewords is the least weight of a nonzero 
codeword. 


Proof. Let d be the minimum distance between two distinct codewords. 
Then there exist two distinct codewords u and v in B n such that d(u, v ) = d. 
Let a: be a nonzero codeword such that wt(x) < wt(y) for all nonzero code¬ 
words y. Now d < d(x, 0) = wt(x). Again d - d(u,v) = wt (u + v) > wt(x). 
Hence, d — wt(x). ■ 


Let M be an m x n generator matrix. We now show that the encoding 
function 

E = E m : B 771 —> B n 


defined by E(a ) = aM defines a group code. For this, let us prove that E(B m ) 
is a subgroup of B n . Let 6, c E E(B Tn ). There exist a, d E B m such that 
b = E(a ) — aM and c = E(d) — dM. Therefore, b + c — aM + dM = 
(a + d)M — E(a + d). Thus, b + c E E(B Tn ). Now in B n , c — — c. Hence, 
E(B m ) is a subgroup of B n , proving that E defines a group code. 

A group code can be decoded by the following procedure, known as the 
tabular procedure. We explain this procedure with the help of an example. 

Consider the code (B 3 , H 6 , E, D ) with generator matrix 

“Pi 
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M = 


10 0 111 
0 10 0 11 
0 0 110 1 


We have. 

Elements of B 
000 

, 001 
010 
Oil 
100 
101 
110 
111 


The Set of Codewords Em{B 3 ) 
000 M = 000000 
001M - 001101 
010M = 010011 
011M - 011110 

ioom = loom 

101M - 101010 
110M = 110100 

him = mooi. 


Let C — Em{B 3 ). Then C is a subgroup of B 6 . 

Step 1: List the codewords in a row with 000000 first. 

000000 001101 010011 011110 100111 101010 110100 111001 


Step 2: Choose a word x in B 6 of least weight among those not in the 
previously chosen cosets. Then list the elements of the left coset x + C as the 
next row appearing below a for every a £ C. Let us take x = 100000. 

C : 000000 001101 010011 011110 100111 101010 110100 111001 

x + C: 100000 101101 110011 111110 000111 001010 010100 011001 

Step 3: Repeat step 2 until all elements of B 6 are exhausted. 

Step 4: Decode each received word as the codeword of the column in which 
the received word appears. 

The table obtained by the above process is called the decoding table. 
The decoding table for the above code is shown below: 


c 

000000 

001101 

010011 

011110 

loom 

101010 

110100 

111001 

100000 + c 

100000 

101101 

110011 

111110 

000111 

001010 

010100 

011001 

010000 + c 

010000 

011101 

000011 

001110 

110111 

111010 

100100 

101001 

001000 + c 

001000 

000101 

011011 

010110 

101111 

100010 

111100 

110001 

000100 + c 

000100 

001001 

010111 

011010 

100011 

101110 

110000 

111101 

000010 + c 

000010 

001111 

010001 

011100 

100101 

101000 

110110 

111011 

000001 + c 

000001 

001100 

010010 

011111 

100110 

101011 

110101 

111000 

100001 + c 

100001 

101100 

110010 

111111 

000110 

001011 

010101 

011000 


The x’s chosen are the coset leaders in the coset x + C. Suppose the received 
word is 110011. In the above table, it appears in the third column. Hence, the 
decoder decodes the received word as 010011. We note that a decoding error 
is possible; for if the error pattern was actually 001010, then the codeword 
transmitted was 111001. 
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26.1.1 Worked-Out Exercises 


Exercise 1 Let c € B n . Let q be an integer such that 0 < q < n. Prove that 
there exist Q) elements w 6 B n such that d(w, c) = q. 

Solution: We have that d(w, c) = q if and only if w and c differ in exactly 
q bits. There are exactly (™) ways to change q bits of c. 


Exercise 2 Let C be a set of codewords in B n . Prove that if C can correct k 
errors, then 

\C\< 


2 n 


(S) + © + •■• + ©■ 


Solution: Let N = Q + (”) + ■•■+ © - For all c (= C, let S k (c) = {w € B n 
I d(w,c) < k}. Now the S k (c) are pairwise disjoint and contain N elements. 
Since there are \C\ distinct 5fc(c), JV|C| < \B n \ = 2 n . Thus, \C\ < 


26.1.2 Exercises 

1. Find the weight of each word below: 

(i) 11011010, 

(ii) 11000110. 

2. Find the distance between the following pairs of words: 

(i) 11011011 and 10001010, 

(ii) 11000100 and 00111011. 

3. Let M be an m x n matrix whose submatrix consisting of the first m 
columns is the identity matrix. Let M' be the submatrix of M consisting 
of the last m — n columns of M. Set 



Prove that c is a codeword if and only if cK = 0. 


4. For the matrices M and K of Exercise 3, prove that C is a subgroup of 
B n , where 

C = {c| c£B n , cK = 0}. 

5. Let M be an m x n encoding matrix whose submatrix M" consisting of 
the first m columns of M is invertible. Let M' be as defined in Exercise 
3 and set 

T 

1 n—m 

Prove that c is a codeword if and only if cK = 0. 
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6. Find the set of codewords of Example 26.1.10. 

7. For each of the following generator matrices, find how many errors the 
corresponding code can detect and how many errors it can correct. 

" 1 0 0 1 1 " 

(i) 0 10 0 1 
0 0 110 

' 1 0 0 0 1 0 0 1 ' 

r ..v 0 10 0 110 1 

^ 0 0 1 0 0 1 1 1 ■ 

_ 0 0 0 1 1 0 1 0 ^ 

8. Write the complete coset decoding table for the code given by the gener¬ 
ator matrix 

‘ 1 0 0 1 1 0 ' 

0 10 10 1 . 

0 0 10 11 

From the table, decode the following received words: 

001111, 101010, 011110. 

9. Let C be an (m,n) code. Suppose that each word b € B n with wt(6) < t 
is the coset leader of b + C. Prove that C corrects t or fewer errors. 

10. (i) Show that no (2,4) code can correct single errors. 

(ii) Show that no (3, 6) code can correct two errors. 

11. (i) Construct a (2,5) code that corrects a single error. 

(ii) Construct a (3,6) code that corrects a single error. 

12. Let s — 4. Construct the (11,15)-Hamming code. 

13. Prove Theorem 26.1.15. 

14. Prove Theorem 26.1.16. 

26.2 Polynomial and Cyclic Codes 

In this section, we describe a technique, which encodes m-digit message words 
into n-digit codewords by polynomial multiplication. 

Let a = (ao,ai,..., a m _i) £ B m . Then the correspondence 

a —» ao + a\x -1- • • • + a rn -ix Tn ~ l - a(x) (26.2) 

“Pi~Pt±y.£jA±A. 
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is a one-one mapping of B n into the polynomial ring Z 2 [x]. Set s — n — m. and 
let 


g{x) = b 0 + bix H-h b s x s 

be a polynomial of degree s over Z 2 such that 60 7 ^ 0. Then 
c(x) = Co + Cix 4- • • • 4- c n _ix n_1 = a{x)g(x) 
is a polynomial of degree < n — 1 . Considering the correspondence, 


c = (c 0 ,ci,... ,c n _i) c(x) 
leads us to the following definition. 

Definition 26.2.1 Let n, m be positive integers such that n > m. Let g(x) = 
bo + b\x 4- ■ • • 4- b s x s be any fixed polynomial of Z 2 [ 2 c] of degree s = n — m such 
that bo 7 ^ 0. The encoding polynomial g(x) encodes each m,essa,ge word a G B m : 


which corresponds to a(x) in (26.2) into 
the code polynomial c(x) = a{x)g{x). 


Example 26.2.2 Letm = 3,n = 7, a,nd 
x + x 4 . By considering the product c(x) 
encoding scheme: 



000 -> 0000000 



001 - h . 0011001 



010 -»■0110010 



Oil -> 0101011 


Define thi 

e m.atrix G by 





r b 0 b x 

&2 


G = 

0 

0 

bi 



0 0 

bo 



‘110 

0 


= 

0 1 1 

0 



0 0 1 

1 

Then the 

above encoding scheme can 

be 


aG. 


the codeword c, which corresponds to 

the encoding polynomial be g{x) = 14- 
= a(x)g(x), we obtain the following 

100 1100100 
101 ->1111101 
110 -> 1010110 
111 -► 1001111 

63 64 0 0 

62 ^3 0 

&1 62 63 64 

100" 

0 1 0 

0 0 1 

obtained by the m,a,trix multiplication 


Example 26.2.3 Letm = 3,n = 4, and g(x) — 1 + x. Then the encoding poly¬ 
nomial g(x) encodes message words a G B 3 by the following encoding scheme: 


000 - 

-+ 0000 

100 1100 

001 - 

0011 

101 -> 1111 

010 - 

-> 0110 

110 -> 1010 

011 - 

-+ 0101 

111 -> 1001 


7^o^Lc#fta£uiaZ 
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The encoding matrix is 


G = 


110 0 
0 110 
0 0 11 


The code satisfies an even parity check. 


Example 26.2.4 Letm = 11, n = 15, andg(x) = l+x+x A . Then the encoding 
m,a,trix corresponding to the encoding polynomial g[x) is given by 


G = 


1 1 0 
0 1 1 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 10 
0 0 1 
1 0 0 
1 1 0 
0 1 1 
0 0 1 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
1 0 0 
0 1 0 
0 0 1 
1 0 0 
1 1 0 
0 1 1 
0 0 1 
0 0 0 
0 0 0 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
1 0 0 
0 1 0 
0 0 1 
1 0 0 
1 1 0 
0 1 1 


0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
0 0 0 
1 0 0 
0 1 0 
0 0 1 


We have for the m.atrix M" defined following Definition 26.1.5 that 


M"~ l 


11110101100 “ 
0 11110 10 110 
0 0 11110 10 11 
0 0 0 1 1 1 1 0 1 0 1 
00001111010 
0 0 0 0 0 1 1 1 1 0 1 


0 0 
0 0 
0 0 
0 0 
0 0 


0 0 
0 0 
0 0 
0 0 
0 0 


0 0 11110 
0 0 0 1 1 1 1 
0 0 0 0 1 1 1 


0 0 0 0 0 1 1 


0 0 0 0 0 0 1 J 

“Pi 7^o^Lc#fta£uiaZ 
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Thus, the parity-check, m.atrix K of Definition 26.1.7 is given by 


K = 


110 0 
0 110 
0 0 11 
110 1 
10 10 
0 10 1 
1110 
0 111 
1111 
10 11 
10 0 1 
10 0 0 
0 10 0 
0 0 10 
0 0 0 1 


The matrix G is the encoding matrix for the (11,15) -Hamming code. 


Theorem 26.2.5 The error polynomial associated, with any undetected error 
vector e — (eo,ei, . ..,e n _i) of a, polynomial ( m,n)-code with generator g{x) 
must be a nontrivial multiple of g{x). 


Proof. If the received word r = (ro,..., r n _i) with the corresponding 

polynomial r(x) = ro+rix-i -|-r n _ix n_1 is erroneous, but undetected. Then 

r is a codeword and so the error e = f — c is a nonzero codeword. Hence e 
must correspond to a nonzero codeword q(x)g{x). ■ 

The division algorithm is very convenient in the detection of errors. Con¬ 
sider a polynomial (m, n)-code with generator g(x). Suppose r{x) is the poly¬ 
nomial corresponding to the received word f. By the division algorithm, there 
exist polynomials q(x) and t(x) such that r(x) = q{x)g{x) + t(x), where either 
t{x) = 0 or deg t[x) < degp(x). If t{x) ^ 0, an error has occurred and we 
deduce that f was not a codeword. 

We now begin our discussion of cyclic codes. 

The cyclic shift of the n-tuple (co, c\,. . ., c n _i) is defined to be the n-tuple 
(cn-ijCo, ci, ..., c n _ 2 )- A linear code is said to be cyclic if the cyclic shift 
of every codeword is again a codeword. Once again we identify the n-tuple 
( cq , Ci,..., c n _i) with the polynomial c(x ) = cq + c\x H-f- c n _ \x n ~ l . 


Theorem 26.2.6 Every linear cyclic (m, n)-code has a, unique m,onic codeword 
g(x) of degree s = n — m, and g(x) divides x n — 1. Let c(x) be of degree n — 1 
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or less. Then c(x) is a codeword if and only if c(x) is a multiple of g{x). 
Also, every monic polynomial of degree s, which divides x n — 1 , is the generator 
polynomial of a linear cyclic (m,n)-code with s = n — m. 

Proof. Consider an arbitrary linear cyclic code and let g{x) of degree s 
be the minimum degree polynomial among the codewords. {g(x) is necessarily 
monic since the coefficients of g(x) lie in Z 2 .) Now xg(x), x 2 g(x ),..., x n ~ s ~ 1 g(x ) 
are just cyclic shifts of g(x) and must be codewords. Since the code is linear, all 
combinations of g(x), xg(x), x 2 g(x),..., x n ~ s ~ 1 g(x), i.e., the 2 n ~ s polynomials 
a(x)g(x), where the degree of a(x) is less than n — s = m, must be codewords. 
But this is all the codewords since if c(x) is any codeword, the division algo¬ 
rithm gives c(x) = a{x)g{x) + r(x), r(x) = 0 or degr(x) < s, a(x) = 0 or 
deg a(x) < n — s. 

Thus, r(x) = c(x) — a(x)g(x), which by the linearity of the code shows 
that r(x) must be a codeword. Since r(x ) = 0 or degr(x) < s, we have by 
the minimality of degg(x) that r(x) — 0. Hence, c(x) is a multiple of g{x). 
Therefore, g{x) is the generator polynomial of the cyclic code. Finally, we note 
that x n ~ s g{x ) — (x n — 1 ) is the cyclic shift of x n_s- 1 p(x) and hence a codeword 
and thus a multiple of g(x), i.e., there is a polynomial a(x) of degree less than 
n — s such that 


x n s g{x) - (x n - 1 ) = a{x)g(x) 


or 


x n - 1 = h(x)g(x), 

where h{x) — x n ~ s — a{x) is a monic polynomial of degree n — s. Thus, g(x) 
divides x n — 1. There are 2 n ~ s codewords, so m = n — s. 

Conversely, let g(x) be a polynomial of degree s which divides x n — 1. If 
the polynomial 

c(x) = Co + C\X +-h C n _lX n_1 

is multiplied by x mod(x n — 1 ), the result is Cn-i + cox + ■ • • + c n -2 xn ~ 1 ■ The 
codeword represented by the polynomial xc(x)mod(x n — 1 ) is seen to be a cyclic 
shift of the codeword represented by the polynomial c(x). Since every cyclic 
shift of a codeword therefore gives another codeword, the code is a cyclic code. 
Hence, the set of multiples of g(x)mod(x n — 1) forms a linear cyclic code. ■ 

Example 26.2.7 Consider the linear binary cyclic code with the codeword set 

{ 000 , 011 , 101 , 110 }. 

The codeword 110 corresponds to the polynomial 1 + x, which is the minimum. 
degree codeword polynomial. Hence, this cyclic code is the length n = 3 cyclic 
code generated by g(x) = 1 + x. 





26.2. POLYNOMIAL AND CYCLIC CODES 


563 


Example 26.2.8 Choose the generator polynomial of a, binary cyclic code to 
be g{x ) = 1 + x + x 3 . It is readily checked, that g(x) divides x 7 — 1 so that we 
m,ay choose n — 7. The number of information digits in this code is m = 7 — 3 
= 4. The 2 m = 2 4 codewords are the 7-tuples corresponding to the polynomials 

a(x)g{x) — (a 0 + a\X + a^x 2 + a3X 3 )(l + x + x 3 ). 

This code is the second, in the class of Hamming single-error-correcting codes. 
(There is a, code in this class with n = 2 l — 1 and m = 2 1 — i — 1 for i = 2,3,4, 

....) 

Example 26.2.9 Choose the generator polynomial of a binary cyclic code to 
be g(x ) = 1 + x + x 4 . It is readily verified that g{x) divides x 15 - 1 so that 
we may choose n = 15. The number of information digits in this code is m 
= 15 — 4 = 11. This code is the third in the class of Hamming single-error- 
correcting cyclic codes. 

The representation of codewords by the polynomials modulo x n — 1 in the 
proof of Theorem 26.2.6 suggests that we could have introduced cyclic codes 
by means of ideals of commutative rings. Consider the polynomial ring Z2(x], 
the ideal generated by ( x n — 1) , and the quotient ring 

B n = Z 2 [x]/ (x n — 1). 


Then 


B n — {ao + a\x H-b a n _ix n 1 | a; £ Z2, i = 0,1 ,..., n — 1}, 

where x denotes the coset x + (x n — 1) in B n . Let C C B n be a cyclic code 
and fix) = ao + aix + ■ ■ • + a n ~ ix 71-1 G C. Since x 77 = 1 in B n , we see that 
x/(x) = o n _ 1 + aox + afx 1 + • ■ ■ + a n _2X n_1 is the cyclic shift of /(x). Since 
C is cyclic, x/(x) £ C. Therefore, g{x)f(x) £ C for any /(x) £ C and any 
g(x) £ B n . Hence, C is an ideal of B n . Clearly if C is an ideal of B n , then 
x/(x) £ C for all f(x) £ C. Thus, we have shown the following result. 


Theorem 26.2.10 LetC C B n be a linear code. Then the following conditions 
are equivalent. 

(i) C is cyclic. 

(ii) xC C C. 

(Hi) C is an ideal of B n . M 


Now B n is the homomorphic image of the principal ideal ring Z2[x] and 
so B n is a principal ideal ring. Thus, if C is an ideal of B n , there exists 
g(x) £ Z2 [x] such that (g(x)) = C. The polynomial g(x) has special properties 
which we describe in the exercises, and which can be seen by Theorem 26.2.6. 
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We now determine the dimension of a cyclic code C, where C is considered 
to be a subspace of the vector space B n over Z 2 . Let C = (g(x)) , where g(x) 
divides x n — 1. We recall that if R is a commutative ring and a £ R, then the 
annihilator of a, ann(a) — {r £ R \ ra — 0}, is an ideal of R. 

Lemma 26.2.11 Let g(x ) € Z 2 [ 2 ^] divide x n — 1. Then in B n , 

ann(g(x )) = (h(x)) , 


where x n — 1 = h(x)g{x). 


Proof. Since 0 = x n — 1 = h(x)g{x), h(x ) G ann(g(x)). Thus, 

(/i(x)) C ann(p(x)). 

Let /(x) € ann(g(x)). Then f(x)g(x) = 0. Therefore, f{x)g{x) £ Ker rj - = 
(x n — 1) , where rj is the natural homomorphism of Z 2 [x] onto Z 2 [x]/ ( x n — 1) = 
B n . Thus, there exists q[x) £ Z 2 [x] such that f(x)g(x) = g(x)(x n — 1). There¬ 
fore, f(x)g(x) = q(x)h{x)g{x) and so /(x) = q(x)h(x). Thus, 

f(x) = q(x)h(x) G (h(x)) . 

Hence, ann (g(x)) C { h,(x )}. Consequently, ann(p(i)) = (h(x)) . ■ 

Theorem 26.2.12 Let g(x) G Z 2 [a^] divide x n — 1. Let C ~ (g(x)) be a 
cyclic code in B n . Let s = deg g(x) and m = n — s. Then X = { g(x ), 
xg(x ),..., x m ~ 1 g(x)} is a basis of C over Z 2 [x], 


Proof. Since g(x) divides x n — 1, there exists h(x) G Z 2 [x] such that x n — l = 
h{x)g{x). We show that X spans C. Let /(x) G C. Then fix) — k(x)g(x) 
for some k{x) G Z 2 [x]. By the division algorithm, there exists q(x),r(x) G 
Z 2 [ x] such that k{x) = q{x)h{x) + r(x), where either r(x) = 0 or degr(x) < 
degh(x) = m. By Lemma 26.2.11, h(x)g(x) = 0. Thus, 


f(x) = (q(x)h(x) + r(x))g{x) = r(x)g(x) 


and degr(x) < m — 1. Hence, /(x) is a linear combination over Z 2 of the 
elements of X. Therefore, X spans C. Suppose 0 = aop(x) + a\xg{x) + • • • + 
a m _ix m_1 g(x), where a* G Z 2 , i = 0,1,... , m — 1. Let f(x) = ao + a\x + 

-ha m _ix 771 " 1 . Then 0 = f(x)g(x). Hence, f(x) G ann(g(x)) — {h(x )). Thus, 

{/(x)) C (h(x)) and so h(x)\f(x) by Exercise 5. Now deg /(x) < m — 1 < 
m = degh(x). Consequently, f(x) -- 0. Hence, ao = a\ = • • • = a m _i — 0. 
Therefore, X is linearly independent over Z 2 . ■ 
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Corollary 26.2.13 Let g(x) £ Z2 [^c] divide x n — 1. Let C = (g(x)) be a cyclic 
code in B n . Then C has dimension m over Z2. Fur therm,ore, C is an (n,m)- 
code. ■ 


We recall that if C — ( g{x )) is a cyclic code in B n , then 


go 

g 1 

gi 

g s 

0 

0 

0 

20 

g\ 

g s -i 

g s 

0 

0 

0 

. . . 

go g\ 

g 2 

g s 


(26.3) 


is a generator matrix for C, where g(x ) = go+g\x+- ■ -+g s x s . We now determine 
a parity-check matrix H for C. We have that x n — 1 = h(x)g(x) for some 

h{x) £ Z2 [x ], where deg h{x) — n — s = m. Write h(x) = ho~\-hix-\ -1- h m x m . 

Define H to be the n x s matrix, 


' 0 

0 

hm 

0 

hm 

hm—(s — 2 ) 

b j rn 

hm— 1 

hm~(s — 1 ) 

h $—1 

h $—2 

ho 

^2 

hi 

0 

hi 

ho 

0 

ho 

0 

0 


(26.4) 


Lemma 26.2.14 Let G and H be defined as above. Then GH = 0. 


Proof. The (i,j)th component of the matrix GH is given by 

n—1 

'y ^ ffn—t+fc+1 h'n—j—k ) (26.5) 

k =0 

where g s +i = ■ ■ ■ = g n -i — 0 and h m - |_i = ■ • ■ = h n -\ — 0 and where the 
subscripts of the g n - l+ k+i and the are each taken modulo n. Since 

g(x)h(x) = 0, we have 


go^t + • • • + gkLt-k + • ■ • + gtho — 0 

for k = 0,1,..., t; t — 0,1,..., n — 1. Hence, if we take the subscripts in Eq. 

(26.5), 

gihj+n -1 + ■ • ■ + gi+khj+n-k-l + • • * + gi+n-lhj, 
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modulo n, we have 

m—1 

^ y 9i+khj+n—k— 1 = 0 - 
k =0 

Thus, GH — 0. I 

It seems advisable to give an example illustrating Lemma 26.2.14 before 
stating our next result. 

Example 26.2.15 Letn = 7 and g{x) — l+x+x 3 . Then h{x) = l+x-\-x 2 +x A . 

Thus, s — 3 and m = 4. We have that 

' 1 1 0 1 0 0 0 ' 

0 110 10 0 

G_ 0011010 

0 0 0 1 1 0 1 

L J 

and 

' 0 0 1 ' 

0 1 0 

1 0 0 

H= Oil. 

1 1 1 

1 1 0 

1 0 0 

Now GH is a 4 X 3 m,atrix and the (3,2) entry of GH is 

95L5 + < 76^4 + 9763 + gahv + <79^1 + £710^0 + £ 7 n^-i 
— £ 75^-5 + £ 76^-4 + goh3 + g\h2 + £ 72^-1 + # 3^0 + g^he 

(taking subscripts rrwdulo 7) 

= 90^3 + 9lh 2 + £72^1 + 53^0 

since g A = £75 = g 6 = 0 = h 5 = h 6 . Now g 0 h 3 + £ 7^2 + £ 72^1 + 93^0 = 0 since 
g(x)h(x) = 0. 

In the following theorem, we show that the matrix H given by Eq. (26.4) 
is a parity check matrix of the cyclic code which is generated by the matrix G 
of Eq. (26.3). 

Theorem 26.2.16 Let C = ( g(x)) be a cyclic code in B n , where g{x)h{x) = 
x n — 1 and deg g{x) = s > 1. Let G and H be the m, a trices given in Eqs. (26.3) 
and (26.4). Let D = {r(x) 6 B n \ rH — 0}, where r(x) = rfx 1 and 

r = r 0 ri • • ■ r n _ 1 for G Z 2 , i = 0,1,..., n — 1. Then D = C. 
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Proof. By Lemma 26.2.14, D C C. By Theorem 26.2.12, dimC = m. 
Hence, it suffices to show that dimD = m. Since g{x)h{x) = x n — 1, h m = 1, 
where h(x) = YiL o^x 1 . Thus, the s columns of H are linearly independent. 
Therefore, if we let Do denote the subspace of B n spanned by these columns, 
then dimDo = s. From linear algebra, we recall that the orthogonal complement 
Dq - {r(x) E B n | r o t = 0, t(x) E Do} has dimension m — n — s. However, 
Do — D and so dim D = m ,, the desired conclusion. ■ 


Example 26.2.17 Let n = 4. It follows that 1 — x 4 = (1 + x)(l + x + x 2 +x 3 ). 
Let g(x) = 1 + x. Then 5 = 1 and m, = 4—1 = 3. Hence, h(x ) - 1 + x + x 2 +x 3 . 


Thus, 


G = 


110 0 
0 110 
0 0 11 


and 


H = 


1 

1 

1 

1 


We recall that this is the (4,3) code in Exam,pie 26.2.7. 


We now examine error detection and correction for cyclic codes. We know 
from Theorem 26.1.16 that for a code to detect all sets of k or fewer errors, it is 
necessary and sufficient that the minimum distance between any two codewords 
be at least k + 1, and for a code to correct all sets of k or fewer errors, it is 
necessary that the minimum distance between codewords be at least 2k + 1. 
Theorem 26.1.19 says that the minimum distance between any two distinct 
codewords is the least weight of a nonzero codeword. 

Let Fbea field containing Z 2 and a E F be a primitive nth root of unity 
over Z 2 , that is, a has order n in the group (F\{ 0}, •}. Thus, 1, a,..., a n_1 are 
distinct. Hence, n is odd else n — 2k for some k and so ( a k — l) 2 = a n — 1 = 0. In 
this case, a k — 1 = 0, which contradicts the fact that 1 , a, ..., a n ~ l are distinct. 
Since a E F, a 2 ,..., a n_1 E F. Therefore, 

x n — 1 = {x — 1 ){x — a)(x ~ a 2 ) ■ ■ • (x — a 71-1 ) over F. 

Let g{x) E Z 2 [z] be any polynomial which divides x n — 1. Then the set of all 
roots of g(x) is a subset of {1, a,..., a n-1 }. 


Theorem 26.2.18 Let a he a primitive nth root of unity over Z 2 , n > 1. Let 

C ~ (s(z)> 

he a cyclic code in B n . Let u,v be integers such that l<u<v<n — 1, 
a u , a u+1 ,..., a v are roots of g(x). Then d > v — u + 2, where d is the minimum, 
distance of the code C. 
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Proof. Let t — v — uPl and f(x) = aQ + a\x-\ - \-a n -\x n ~ l £ C. It suffices 

to show that wt(a) > t + 1, where a = ao^i • • • a n -i- There exists q{x ) £ Z 2 
such that f(x ) = q{x)g(x). Thus, fipl) = q(a l )g(a l ) = 0 for i = u, u + 1,..., v. 
Consequently, 




1 

1 

1 



a u 


a u+t -1 

[aocii - 

‘ ^71—l] 

a 2u 

a 2(u+l) 

a 2N+*-i) 



a (n-l) u 

a (n-l)(u+l) 

a (n-l)(u+i-l) 


[00 - - * 0 ]. 


Suppose that wt(a) = s < t + 1. Then exactly s of the a* = 1, say, aq = 
ai s = 1, where i\ < • • ■ < i s . Now 


a is\ 

' a nu 

a l2U 

a n(u+l) 

a i2 C+ 1 ) 

a n(u+S“l) 
a i 2 (u+s- 1) 


a isU 

a b(u+l) 

^(■U+S-l) 


[0 0 • • • 0 ]. 


Thus, 


0 = a ^-H2u+-+t.«de t 


1 a* 1 
1 a i2 


(a^y - 1 

(a i2 ) s ~ l 


1 a is (a is ) s ~ l 


However, this is impossible since the a n , a* 2 ,..., a ls are distinct and the deter¬ 
minant is a Vandermonde determinate. ■ 


We now consider only binary cyclic codes of odd length n. Then x n — 1 has 
distinct factors. The factors are generators of binary cyclic codes. However, it 
can be difficult to find the factors. Fortunately, an ideal can have more that 
one generator. We will be interested in certain kinds of generators since there 
is a method of determining them. 

A generator e(x) of an ideal in B n is called an idempotent generator if 
it is an idempotent, that is, if e 2 (x) = e(x). Note that if a(x) is in ( e(x )), then 
a(x) = b(x)e(x) for some b{x ) and so a(x)e(x) = b(x)e 2 (x) = b(x)e(x ) = a(x). 
That is, e(x ) is an identity of (e(x)) . Conversely, an idempotent that is an 
identity for an ideal I, generates I. This follows from the following argument. 

Suppose e(x) is an identity for an ideal I in B n . Then for all a(x ) £ /, 
a(x ) = a(x)e(x). Thus, I C { e(x )). Since e(x) £ I, (e(x )) C I. Hence, I = 
(e(z)). 

In order to determine how we find idempotent generators, we introduce the 
notion of a cyclotomic coset. 
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Consider any integer s such that 0 < s < p™ — 1 , where p is a prime and 
let r be the smallest nonnegative integer such that p r+1 s = s(mod(p m — 1 )). 
The cyclotomic coset containing s consists of {s,ps,p 2 s,... ,p r s}, where each 
p l s is reduced modQ ? 771 — 1). If gcd(s,p m — 1) = 1, then r — m — 1, but if 
gcd(s,p m — 1) 7 ^ 1, then r varies with s. Note ( p m — l)](p r+1 s — s) and so 
( p m — l)|(p r+1 — l)s. Even though cyclotomic cosets are not cosets of a group, 
they partition the integers mod ( p m — 1 ), that is, each integer mod ( p Tn — 1 ) 
is in exactly one coset. If u is the smallest element in its cyclotomic coset, we 
denote the coset by C u . 

Let m = 4 and p = 2. Let s — 0. Then (2 4 — 1)|(2 0+1 0 — 0) and so r = 0. 
Hence, Cq = {0}. Let s = 1 . Then ( 2 4 — l)|(2 r+1 1 — 1) and so r = 3. Thus, C\ — 
{1, 2,4, 8}. Let s = 3. Then (2 4 - l)|(2 r+1 3 - 3), i.e., (2 4 - l)|(2 r+1 - 1)3. Now 
(2 4 — 1 ) /(2 0+1 1 — 1)3, (2 4 —1) /(2 1+1 1 —1)3, (2 4 —1) does not divide (2 2 + 1 l-l)3, 
but (2 4 — 1)|(2 3+1 1 — 1)3. Therefore, r = 3. Consequently, C 3 = {3, 6 ,12, 24} = 
(3, 6 ,12,9}. Let s = 5. Then (2 4 — l)|(2 r+1 5 — 5). Now (2 4 — 1) does not divide 
(2 0+1 l - 1)5, but (2 4 - 1 )|( 2 1+1 1 - 1)5. Thus, r = 1. Hence, C 5 = {5,10}. 
Note that C 0 = {0}, C x - (1,2,4,8}, C 3 - {3,6,12,9}, C 5 = {5,10}, and 
C 7 = {7,14,13,11} partition {0,1,2,..., 14}. 

We now illustrate how we determine idempotent generators. Let n = 7. 
Suppose we have e(x) = ao + a x x + • —h a^x 6 . Now the coefficient of e 2 (x) for 
x 1 is 

i 

Xi 0>j a i—j , 

3=0 

where i — 0,1,..., 12 and we have ai = 0 for i = 7,..., 12. Hence, 


cijCLi^j 

3 -0 


0 if i is odd 
a/*i 2 if i is even. 


For example, ao a O — a o> a 0 a i + a i a 0 — 0, ao a 2 + aia-i + G 2&0 — a \i an d 
aoa .3 + axa 2 + 0201 + a 3 ao = 0. Now 0 is the coefficient of x 7 , x 9 , x 11 and 
x 8 = x, x 10 = x 3 , and x 12 = x 5 . Thus, 


e 2 (x) = Qq + a\x + a 2 x 2 + a^x 3 + a\x A + agX 5 + a 2 x 6 = e(x). 


Hence, do = a 0 ) a i = a A, a 2 ~ a i) a 3 = a 5 > a 4 — 02 ? a 5 — a 6> and ag = a 3 or 
ag = ao, ai — d 2 — 04 , a 3 = as ~ ag. Note that modulo 7, we have 2-0 = 0, 
2-1 = 2, 2-2 = 4, 2-4 = 1, 2-3 = 6 , 2-6 = 5. Thus, we see that this can 
only happen if S' is a union of cyclotomic cosets for n = 7, where S is the set 
of powers of x that occurs with nonzero coefficients in e(x). The proof of the 
following result follows by a similar argument. 


Lemma 26.2.19 Let f{x) £ B n . Then /(x) is an idempotent in B n if and 
only if the set S of powers of x that occur with nonzero coefficients in /(x) is 
a union of cyclotomic cosets. 
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We now determine the cyclotomic cosets for n = 7 = 2 3 — 1, i.e., m = 3 
and p = 2. Let s = 0, then 7|(2 o+ °0 — 0) and so r = 0. Thus, Co — {0}. Let 
s = 1. Then 7|(2 2+1 1 — 1) and so r = 2. Hence, C\ = {1,2,4}. Let s = 3. Then 
7|(2 2+1 3 — 3) and so r = 2. Thus, Co = {3,6,12} = {3,6,5}. The cyclotomic 
cosets yield the eight idempotents. We list them below. 


Idempotent generator 

e\(x) = 1 + x + x 2 + x 3 + x A + x 5 + x 6 ; Co U C\ U Co 

e 2 (x) = 1 + x 3 + x 5 + x 6 ; Co U Co 

6o{x) = 1 + x + x 2 + xC q U C\ 

e 2 (x) + eo(x) = x + x 2 + x 3 + x 4 + x 5 + x 6 ; C\ U Co 

e\(x) + e 2 (x) = x x 2 + x 4 ; C\ 

e\{x) + eo(x) = x 3 + x 5 + x 6 ; Co- 


The two remaining idempotents are 0 and 1. Now 1 generates the whole 
space, while 0 generates the zero space. We see every code of length 7 has an 
idempotent generator. 

Theorem 26.2.20 Every cyclic code has an idempotent generator. 


Proof. Let g(x) be the generator polynomial of the cyclic code C. Then 
x n — 1 = g{x)h{x) for some h[x). Since x n — 1 has distinct factors, g(x ) and 
h(x ) are relatively prime. Thus, there exist s(a:), t(x) £ Z 2 such that 

1 = s(x)g(x) + t(x)h(x). (26.6) 

Let e(x) = s(x)g(x). Then e(x) £ C. We multiply Eq. (26.6) by s{x)g{x) to 
obtain s(x)g(x) = s 2 (x)g 2 (x) + s(x)g(x)t(x)h(x) = s 2 (x)g 2 (x) + s(x)t(x)(x n — 
1). Hence, e(x) = e 2 (x ) -f 0 in B n . Thus, e(x) is idempotent. Let c(x ) £ C. 
Then c{x) = r{x)g{x) for some r(x) in B n . Now multiply Eq. (26.6) by c{x). 
We obtain c( x) = s(x)g(x)c{x) + t{x)c{x)h{x). Thus, c(x) = s(x)g(x)c(x) = 
e(x)c(x). Therefore, e(x) is an identity for C. Consequently, e{x) generates 

c.m 


26.2.1 Exercises 

1. Determine the number of cyclic codes of the following lengths: 

(i) length 6, 

(ii) length 7, 

(iii) length 10. 

2. Let m, = 3, n = 7 and the generator polynomial be g{x) = 1 + x + x 2 + 
x 3 + x 4 . Determine the corresponding (3,7)-code and give the encoding 
matrix G. 
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3. In Exercise 2 , find the parity-check matrix K. 

4. Show that the codes in Examples 26.2.8 and 26.-2.~9 are Hamming codes. 

v; 

5. Let f(x) and h(x) E Z 2 [a:] be such that h(x) divides x n — 1 in Z 2 [x].. 
Prove that {fix)) C (h(x)) if and only if h(x) divides f(x) in Z 2 [^c] - 

6 . Let C be an ideal of B n . Prove the following assertions. 

(i) There exists an ideal I of Z 2 [^c] such that C — T](I) and / D Ker 77 , 
where 77 is the natural homomorphism of Z 2 [:r] onto B n . 

(ii) If C = (g{ x )) , then I = ( g{x )) , where I is the ideal in (i). 

(iii) If g{x) E Z 2 [-^] is such that I = (g{x )), then C = (g(x)) and 
g(x)\(x n - 1 ). 

7. Let C be a cyclic code in B n . Let g{x) E Z 2 [m] be the smallest degree 
polynomial such that g(x) E C. Prove that {g{x)) — C and that g{x) is 
unique. 


26.3 Bose-Chauduri-Hocquenghem Codes 

In this section, we take a very brief look at Bose-Chauduri-Hocquenghen codes 
(BCH codes). For codewords of length several thousand, these codes preform 
very well. BCH codes are multiple-error-correcting codes. The number of check 
digits is a function of the number of errors to be detected or corrected. In the 
following, we give a systematic way to construct binary BCH codes of any 
length. 

Since we only consider binary BCH codes here, our symbols are once again 
from Z 2 ■ Two words are said to have distance d if they differ in d places. We 
wish to construct a code with minimum distance d , i.e., the distance between 
two codewords is at least d. 

Definition 26.3.1 Let a be a primitive nth root of unity over Z 2 . Let mi(x ) 
be the minim,um, polynomial of a 1 over Z 2 , i — 1,... , 77 , — 1. Let d and, u be 
integers, where d > 2 a.nd, u > 0. If 

g{x) = \cm(m, u (x),m u+1 (x)... ,m u+d _ 2 (x)), 

then the cyclic code {g(x )) in B n is called a binary BCH code of length n 
and, distance d. 

Since the polynomials m u (x), m u+ \(x) ..., m, u+d _2(x) are irreducible over 
Z 2 , g(x) is the product of the distinct m d (x), i = u, u + 1 , ..., u + d — 2 . 
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Theorem 26.3.2 Let C — (g(x)) be the binary BCH code of length n and 
distance d. Then the following assertions hold. 

(i) The minimum distance of C is at least d. 

(ii) f(x) £ C if and only if /(a*) = 0 for i — u, u + 1,... ,u + d — 2. 

(Hi) A parity-check m,a,trix for C is given by 


H = 


1 

1 

1 

a u 

a u+1 

a u+d ~ 2 

a 2u 

a 2(u+l) 

a 2(u+d-2) 

(n—l)u 

a (n-l)(u+l) 

a (n-l)(u+d-2) 



Proof. (i) Since m^(x) divides g(x ), a 1 is a root of g(x), for i = u,u + 
1, ..., u + d — 2 . Thus, the desired result follows from Theorem 26.2.18. 

(ii) Let f(x) £ C. Since g(x) divides f(x) and each a L is a root of g(x) by 
(i), each a 1 is a root of f(x). Conversely, suppose that a 1 is a root of f(x) for 
i = u,u+l,...,u + d — 2. Then m^(x) divides f{x) since m,i(x) is the minimal 
polynomial of a 1 , i = u, u + 1, ..., u + d — 2. Therefore, g{x) divides fix) since 
g(x) is the product of the distinct m^a:), which are relatively prime. Thus, 
f(x)eC. 

(iii) Let f(x) = ao + o-ix + • • • + an-fx 11 1 € B n . Then 

a/f = /(a“)/(a“ +1 )---/(a“ + ' 1 - 2 ), 

where a = aoa-i ■ • • a n _i- Thus, dH — 0 if and only if f(o, u ) — f(a u+1 ) — • ■ = 
f(a u+d ~ 2 ) = 0 if and only if f(x) £ C by (ii). ■ 

Example 26.3.3 We construct a binary BCH code with codeword length n — 
15, which has minimum, distance d — 5. Let a be a root of the primitive poly¬ 
nomial 1 + x + :r 4 . Consider the successive powers of a 


a 2 


a 9 

= a 3 

+ 

a 


a 3 


a 19 

= a 2 

+ 

a T 1 

a 4 

— a T 1 

a 11 

- a 3 

+ 

a 2 

+ a 

a 5 

— a 2 + a 

a 12 

- a 3 

+ 

a 2 

+ a + 1 

a 6 

- a 3 + a 2 

a 13 

- a 3 


a 2 

+ 1 

a 7 

= a 3 + a + 1 

a 14 

= a 3 

+ 

1 


a 8 

= a 2 + 1 

a 15 

= 1 





Let m,i{x) denote the minimum, polynomial of a 1 , i = 1,2,3,4. It is easily 
verified that o, a 2 , a 4 , a 8 a,re the roots of 1 + x + x 4 and that a 3 , a 6 , a 9 , a 12 are 
the roots of 1 + x + x 2 + x 3 + z 4 . Thus, 

m\{x) = 777.2(2:) = 7774(2:) = 1 4- x + x 4 
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and 

m^(x) = l + x + x 2 +x 3 + x 4 . 

Hence, 

/cm(mi(x),m2(x),m3(x),7n 4 (x)) = lcm(mi(x) i m^(x)). 

Since m\{x) and mz(x) have no common roots, lcm(m,\(x), 7713 ( 1 )) is of degree 
at least 8. Hence, 

lcm { m ,\( x ), m3(x)) = mi(x)m.3(x) = 1 + x 4 + x 6 -f x 7 + x 8 . 

Since degg(x) = 8, this BCH code has 15 — 8 = 7 information digits. This code 
detects all sets of 5 — 1 =4 or fewer errors and, corrects all sets of (5 — l)/2 = 2 
or fewer errors. 

26.3.1 Exercises 

1. Show that BCH codes are cyclic. 

2. Show that the polynomial 1 + x + x 4 is irreducible over Z2. 

3. Let F be a finite field such that F D Z 2 . Let mi(x) be the minimal 
polynomial of c 1 over Z2, where F* = (c) . Prove that mAx) divides 
x 2 "- 1 - 1, where 2 n - |F|. 
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27.1 Affine Varieties 

This chapter is concerned with the geometry dealing with affine varieties. An 
affine variety is defined by polynomial equations. These polynomial equations 
may define, for example, curves and surfaces. Throughout this chapter, we 
let K[x i,..., x n ] denote the polynomial ring in the algebraically independent 
indeterminates x\,. .., x n over the field K. 

Definition 27.1.1 A product of the form, x^ 1 •••x“ n , where cq,...,a: n are 
nonnegative integers, is called a monomial in x i,..., x n . The sum, ot\ + • —f- 
a n is called the total degree of the m,onom,ial. 

Let x^ 1 • • • x% n be a monomial. Then we simply write x a for x“ x • • ■ x“ n , 
where we let a = (cq,..., a n ). If a = 0, then x a = 1. We sometimes write |a| 
for ai H-f- OL n . 

Definition 27.1.2 Let f — J^ a a Q x a £ K[x i,...,x n ]. Then a Q is called the 
coefficient of the m,onomial x a a,nd a a x a is called a term of f if a Q ^ 0. The 
total degree of f, denoted by deg(/), is the largest |a:| for which a a ^ 0. 

Definition 27.1.3 Let n be a positive integer. The set 

K n = {(ai,... ,a n ) | £ K, i = 1,... ,n} 

is called the affine space over K. 

For / £ K[x i,..., x n ], we can interpret / as a function from K n into K as 
follows: For all (ai,..., a n ) £ K n , 

/((ai,...,a n )) - 

a 

where / = £ a a Q xf } ■ • • x" n . 
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Theorem 27.1.4 Suppose that K is infinite. Let f £ K[x i,..., x n ]. Then / 
= 0 m iffaq,..., x n ] if and only if f : K n K is the zero function. 

Proof. If / = 0, then clearly / : K n —> K is the zero function. Conversely, 
suppose that / is the zero function. The proof is by induction on n, the number 
of indeterminates. Suppose that n = 1. Then by Theorem 14.1.11, / has at 
most m roots, where m is the degree of /. Since /(a) =0 for all a £ K and since 
K is infinite, it follows that f = 0. Now assume that the converse is true for 
n — 1. Let / £ K[x i,..., x n ] be such that f(a\, ..., a n ) = 0 for all (ai,..., a n ) 
£ K n . Now we can express / in the form 

<? 

/ = 1 )<, 

where gi £ K[x i,..., £ n -i]> * = 0,1, • • •, q- Consider any arbitrary fixed (ai,..., 
a n _i) £ K n ~ l . Then /(ai,..., a n _i,x n ) is a polynomial in one indeterminate. 
By the n = 1 case, /(ai,,...,a n _i,x n ) is the zero polynomial in K[x n ] since 
/(ai,..., a n ) = 0 for all a n £ K. Hence, gfiai ,..., a n _i) = 0 for i = 0,1, ..., 
q. Since (ai,... ,a n _i) is arbitrary, it follows by the induction hypothesis that 
each gi is the zero polynomial in K[x i,..., x n _i]. Thus, / is the zero polynomial 
in K[x\,.. ., x n \. ■ 

Corollary 27.1.5 Suppose that K is infinite. Let f,gE K[x i,... ,x n ]. Then 
f = g if and only if f : K n —► K and g : K n —> K are the same function. 


Proof. The proof follows from Theorem 27.1.4 by considering f — g. M 

Definition 27.1.6 Let /i,..., / m £ K[x \ t ..., x n \. The set 

V(fi ,... ,/m) = {(ai ,... ,a n ) E K n \ fi(a u ... ,a n ) = 0 for all i = l,...,m} 

is called the affine variety defined by /i,..., / m . 

We sometimes use the notation V({fi | z = 1,2,..., m}) for P(/i,..., f m ). 
Consider, for example, the following linear system of equations 

x + 2y + z = 2 

x + y — z = 1. 

We replace the second equation by the second equation minus the first 
equation to obtain 

x + 2 y f z — 2 

—y — 2z = —1. 
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We then replace the first equation by the first equation plus two times the 
second equation. We then have 


x — 3z = 0 

-y - 2 z = -1. 


Thus, 


V(x -\-2y + z — 2,x + y — z — 1) = {(3 1 ,1 — 2 t,t) \ t G K}. 


The method used to solve the above system of equations was that of elimination 
of variables. The equations x + 2y + z — 2 = 0 and x+ y —z — 1=0 form what 
is called an implicit representation of V. 

For an application of polynomial equations, we turn to robotics. We con¬ 
sider the motion of a robot’s arm in the plane. We assume that we have three 
linked rods of lengths 6, 4, 2, respectively. 



The positions or states of the arm are determined by the solution in R 6 to 
the following polynomial equations. 


x 2 4- y 2 = 36 
(z — x) 2 + (w — y) 2 = 16 

(u — z) 2 + (v — w) 2 — 4. 


Another application to polynomial equations is in automatic geometric the¬ 
orem proving. We introduce Cartesian coordinates in the Euclidean plane. 
Having done this, many geometric theorems can be expressed as polynomial 
equations. We show, for example, how polynomial equations can be used to de¬ 
termine results concerning the diagonals of a square. Let A, B , C, D be vertices 
of a square. 


~Pu*uc- 


27.1. AFFINE VARIETIES 


577 



Let X denote the point of intersection of the diagonals AC and BD. We 
place side AB on the x-axis with vertex A at the origin. Then the Cartesian 
coordinates of A are (0,0) and those of B are (a,0), where a is arbitrary. The 
Cartesian coordinates of C and D are determined by B. We write (x 1 , 2 / 1 ) and 
(•^ 2 , 2 / 2 ) f°r the Cartesian coordinates of C and D, respectively. We use the 
slope formula for a line segment to translate the defining properties of a square 
into polynomial equations. 

ABLAD : X2 = 0 

AB\\CD : 0 = ( 2/2 - yi)/(x2 ~ x x ) 

\AB\ — \DC\ : a 2 = (x 2 — xi ) 2 + ( 2/2 ~ Vi ) 2 
\AB\ = \AD\ : y 2 = ±a. 

Thus, we obtain the polynomial equations, 

V\ - V2 = 0 

A + {V2 - Vi ) 2 - a 2 = 0 . 

(Hence, xi = ±a and 2/1 = ± a.) 

We also know that A, X, C are collinear, as are D, X, B. Thus, if we let 
(X 3 , 2 / 3 ) denote the Cartesian coordinates of X, we have the following equations, 

A , X, C are collinear : 2 / 3/23 = ( 2/3 - yi)/(xs - x i ) 

D, X, B are collinear : ( 2/3 - 2 / 2)/(23 ~ 2 2 ) = ( 2/3 - 0)/ (m 3 - a). 

We hence obtain the polynomial equations, 


x m ~ 232/1 = 0 
222/3 - 232/2 - ay 3 + ay 2 = 0 . 


(27.2) 


(Thus, x 3 = a/2 = 2/3-) 

Consider the property that the diagonals of a square intersect in right an¬ 
gles. 

AX 2 + YD 2 = AD 2 : (0 - x 3 ) 2 + (0 - j/ 3 ) 2 + (a* - s 2 ) 2 + (jft - ») 2 

= (0 — s 2 ) 2 + (0 — 
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or 


A + vl ~ *2X3 ~ V2V3 = 0. (27.3) 

Therefore, the statement which says that the diagonals of a square intersect 
in right angles translates into the statement that Eqs. (27.1) and (27.2) imply 
the Eq. (27.3). There are, of course, many other conclusions we could derive. 

Lemma 27.1.7 Let V, W C K n be affine varieties. Then V U W and VC\ W 
are affine varieties. 

Proof. Let V = V(/i,...,/ m ) and W = V(gi,...,g q ) for some 
gi,...,g q e K[x 1, ...,x n }. Now (a 1; ... ,a n ) € VDW if and only if fiffii ,... ,a n ) 
= 0 and gj ( a \,..., a n ) — 0 for all -i = 1,..., m and j = 1,..., q if and only if 

(ai,.. . ,a n ) E V(fi > • ■ • > fmi 9li • • • 1 9q) • 

Thus, V DW = E(/i,..-,/m, gi, ...,g q ). 

Let (a 1; ..., a n ) E V. Then /;(ai,..., a n )gj(a 1,..., a n ) = 0 for all i = 
and j = 1, ...,g. Hence, V C E({/^- | i = 1,..., m ; j = 1,..., <?}) 
and similarly W C V ({fcgj \ 1,..., m ; j = 1,..., q}). Thus, VUW CV ({fcgj 
| i = 1 ,...,m ; j = l,...,g}). Let (a!,...,a n ) E E({/^ | i = 

; j — 1,..., <?}). Suppose there exists i such that fiffii,... ,a n ) 7^ 0. Since 

/»(oi,..., a n )5j(ai,..., a n ) = 0, we have that gj(a 1,..., a„) — 0 for all j 
— 1,... ,q. Therefore, (ai,..., o n ) E W. Suppose ... ,a n ) = 0 for all i = 
. Then (ai,...,a n ) E V. Thus, V({figj | i = 1,..., m ; j = 1,..., g}) 

C V U W. Consequently, V({fi9j \ i = 1,... ,m, ; j = 1,..., q}) = V U W. ■ 

We now wish to consider a way of describing the points of an affine variety. 
We can accomplish this at times by parametrizing the variety. Parametric 
representations of curves and surfaces are used to draw them on a computer. 
The implicit representation of a variety is useful in determining whether or not 
a point lies on the curve or surface. 

Consider again the linear system of equations 

x + 2 y + z = 2 

x + y — z = 1. 

Then 

x = St 

y = 1-2 1 

z = t. 

t is called a parameter and Eqs. (27.5) is called a parametrization 
(27.4). 


(27.4) 

(27.5) 
of Eqs. 
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For another example, consider the equation x 2 + y 2 = 1. Then 

x = cos t 
y = sin t 

is another example of a parametrization. Another known parametrization of 
x 2 + y 2 = 1 (except for the point (— 1 , 0 )) is 

x = (1 - t 2 )/(l + t 2 ) 

V = 2t/(l + t 2 ). 

Note that —1 = (1 — 1 2 )/( 1 + t 2 ) is impossible, else —1 = 1 . Next, we show 
how to obtain this parametrization. 



Each nonvertical line through ( — 1,0) will intersect the circle in a unique 
point other than (—1,0). As t varies from —oo to oo, the corresponding point 
(x,y) traverses all of the circle except for the point (—1,0). The slope of each 
nonvertical line is given by 

{t - 0)/(0 - (- 1 )) = {y- t)/{x - 0 ). 


Thus, 

t = y/(x + 1) 
y == t(x + 1 ). 

Substituting t(x + 1) in for y in x 2 + y 2 — 1 yields 

x 2 + t 2 {x + l) 2 = 1 


(27.6) 


(1 + t 2 )x 2 + 2 t 2 x Ft 2 - 1 = 0. 
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Solving this latter equation for x in terms of £, we get the ^-coordinate of where 
the line intersects the circle x 2 + y 2 — 1. One solution is x = —1. Hence, x + 1 
divides (1 + t 2 )x 2 + 2 t 2 x + t 2 — 1. This division yields 

(1 4 - t 2 )x 2 + 2 t 2 x + t 2 — 1 = (x + 1)((1 + t 2 )x — (1 — t 2 )) = 0. 


Setting 

(1 + t 2 )x - (1 - t 2 ) = 0 , 

we obtain 

x = (1 — t 2 )/{l + t 2 ). 

Substituting this into Eq. (27.6) yields 

y = 2t/(l + t 2 ). 

Definition 27.1.8 Let V C K n be an affine variety. Let 

/(H) = {/ € K[xi,...,x n ] | /(ai,..., a n ) =0 for all (ai,...,a n ) € V}. 

Lemma 27.1.9 If V C K n is an affine variety, then I(V ) is an ideal of 
K[x 1 ,...,x n ]. 

Proof. Clearly the zero polynomial is a member of I(V) since 0(ai,..., a n ) = 

0 for all (ai,..., a n ) £ K n . Let f,g £ I (V). Then 

(/ + 0 )(ai, •• - ,On) = /(a i, • ■ • ,a n ) + $(ai, • • ■, a-n) = 0 + 0 = 0 . 

Thus, / + g £ I{V). Let h £ K[x i,..., rc n ]. Then 

(fif') (^i > • • • 5 Q'n) h{cL i,..., affi f ..., Q>n) — h(a\ ,..., <2^)0 — 0. 

Therefore, hf £ /(H). Hence, /(V) is an ideal of /f[xi,... ,i n ]. ■ 

The ideal /(H) in Lemma 27.1.9 is called the ideal of V. 

Example 27.1.10 Let V = {(0,0)} C K 2 . In this exam,pie, we show that 

I(V) = (x,y). 

Let f(x, y)x + g(x, y)y £ (x, y ). Then 

/( 0 , 0 ) 0 + s( 0 , 0)0 = 0 . 

Thus, f(x,y)x + g(x,y)y £ /(H). Hence, {x,y) C /(H).Let f(x,y) £ /(H). 

Then /(0,0) = 0. Now 

n T7i 

/o,^) = J2I2 a v x *y J 

j=0J==0 
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for some a*,- £ K. Therefore, 


n m 


0 — aoo + Q-ioO + Q'OiO + ^ yy a,ij 0 1 0“ ? . 

3=1 i =1 

This implies that a qq = 0 and so 


n m 


f(x, y ) = ai 0 x + ooi y + ^ Y1 dijx'y 3 £ (x, t/) 

J = 1 Z = 1 

Hence, I(V) C (x,y) . Consequently, I(V) = (x,y) . 


Example 27.1.11 LetV = K n . Then f £ I{K n ) if and only if f (a \,..., a n ) = 
0 for all (aj,..., a n ) £ K n . Hence, if K is infinite, then f is the 0 polynomial. 
Thus, I(K n ) ~ {0} if K is infinite. 

Lemma 27.1.12 Let f\,.. ., f s £ K[x\, ..., x n }. Then 

(h,...J s )CI(V(f u ...J s )). 


Proof. Let / £ (/i,..., f s ) ■ Then / — h\fi H-h h s f s for some h\,...,h s £ 

K[x i,...,x n ]. Thus, for all (ai,...,a n ) £ V(fi,..., f s ), 

f (®11 • • • j &n) " h\ (di, . . . , djfi f\ (di, . . . , djfi + h 

hs{d 1, • • • , 0-77) js ( a l i • ■ ■ 1 fl 7l) 

= 0 + --- + 0 
- 0 

and so / £ I(V(f u f s ))- Hence, (/i,..., / s > C I(V{f u ..., /,)). ■ 

Proposition 27.1.13 Let V and W be affine varieties in K n . Then 

(i) VCW if and only if I(V) D I(W), 

(ii) V = W if and only if I(V) = I(W). 

Proof. (i) Suppose that VC. W. Let / £ I(W). Then for all (ai,..., 

On) € W, /(ai,..., a n ) = 0. Hence, for all (ai,...,a n ) £ V, /(ai,...,a n ) = 0 
and so / £ /(H). Thus, I(W) C /(H). Suppose that I(W) C /(H). Since IH is 
an affine variety, there exist gi, ... ,gt £ K[x i,..., x n \ such that W — V(gi ,..., 

9 t)- Therefore, gi,...,g t G I(W) C /(H). Consequently, ^i(ai,..., a n ) = ■ • • = 
gt(a\, ... ,a n ) = 0 for all (ai,...,a n ) £ H. Thus, (ai,...,a n ) £ W for all 
(ai,..., a n ) £ H and so H C W. 

(ii) Clearly, if H = W, then /(H) — /(W). Suppose that /(H) = I(W). 

Then by (i), H D W and H C W. Hence, H = W. ■ 

We have seen in this section how the generators of an ideal in a polynomial 
ring can be associated with a system of polynomial equations. An ideal may 
have different sets of generators. See for example Exercises 3 and 4. In the 
next section, we will be interested in determining the “best” generating set. 
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27.1.1 Worked-Out Exercises 

0 Exercise 1 Let V = V(y — x) C R 2 . Show that 

I{V) - {y~ x ) ■ 

Solution: Clearly y — x E I{V). Hence, (y — x) C I(V). Let x Q y@ be a 
monomial in R[:c,y]. Then a and f3 are nonnegative integers. By the binomial 
theorem 

x a yP = x a (x+(y — x))P 

= ^{x 13 + (E?=i - :r ) z_1 )( V ~ x )) 

= h a p{y ~x)+ x a+ P 

for some h a/ 3 E R[x, y\. Thus, for all / E R[x, y], 

/ = E™=o E/3=o r a px a yP (r a p E R) 

= Eo=0 E?=o r a p{h a p{y -x)+ x a+(3 ) 

= h(y — x) +r 

for some h E R[x, y], f E R[a:]. 

Let / E IiV). Then / = h,(y — x) + r as above. Since / vanishes on V, we 
obtain 

0 = = 0 + r(t) 

for any real number t. Therefore, r — 0 and so / = h(y — x) E {y — x) . Hence, 
I(V) Q (y — x) . Consequently, 


I(V) = (y-x). 

(We see that / E (y — x) if and only if f(t, t ) = 0.) 

27.1.2 Exercises 

1. Let V = V(y — x, z — x 2 , w — j: 3 ) C R 4 . Show that 

I{V) — (y — x, z — x 2 , w — x^ . 

2. Show by an example that equality need not hold in Lemma 27.1.12. 

3. In the polynomial ring K[x,y\, show that 

(x, y) = (x + y, x - y) = (x + xy, y + x y, x + x 2 y 2 , y - zr 2 y 2 ) . 

4. In the polynomial ring Q[x], find a single generator for the ideal 

(x A + 3x 3 + 2x 2 , x 2 + 2x + l\ . 
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5. Show that 

V(Zx 2 + 2y 2 — 11,x 2 — J/ 2 + 3) = {(1,2), (-1,2), (1,-2), (-1,-2)} 
by showing 

(3x 2 + 2 y 2 - 11, x 2 - y 2 + 3^ = (x 2 - 1, y 2 - 4^ . 


27.2 Grobner Bases 

The concept of Grobner bases provides computational means for solving prob¬ 
lems in mathematics, computer science, engineering, and science. The real im¬ 
pact of Grobner bases is that they can be computed. Influenced by Wolfgang 
Grobner, Bruno Buchberger introduced Grobner bases in 1965. His algorithm 
for computing such bases is the major contribution to the theory. We will 
not explicitly give his algorithm here. The interested reader may pursue the 
subject further in Adams and Loustaunau or Becker and Weispfenning or Cox, 
Little and O’Shea. 

In the following, we let W denote the set of whole numbers, i.e., the non¬ 
negative integers. 

Definition 27.2.1 Let y be a relation on 

W n = {(<*!,..., a n ) | = 

and > be the relation on the set of monomials of K[x i,..., x n \ defined by x a 
> X' 3 if and only if a 7- (3. If >- satisfi.es properties (i), (ii), and (Hi), then > 
is called a monomial ordering, where 

(i) y- is a total (linear) ordering on W n ; 

(ii) for all a, (3, 7 6 W n , a 7 ft implies a + 7 7- (3 + 7; 

(Hi) every nonempty subset of^W n has a smallest elem.ent relative to 7 . 

We ask the reader to verify that a relation 7 on W n satisfies (iii) of Defini¬ 
tion 27.2.1 if and only if every strictly decreasing sequence in W n terminates. 

Definition 27.2.2 Define the relation 7/ on W n by for all a, (3 £ W n , a >-1 (3 
if the left,-m.ost nonzero entry in a — (3 € Z n is positive. Define the relation >/ 
on the set of m.onomials of K[x\,... ,x n ] by x a >1 x@ if and only if a 7/ (3. 
Then 7/ and >1 are called lexicographic (or lex) orders. 

Definition 27.2.3 Define the relation >- gr i on W n by for all a, (3 E W n , 
& grl P if 

|o| > \P\ , or \a\ = \P\ and a 7/ ft. 

Define the relation > gr i on the set of monomials of K[x 1 ,..., x n ] by x a > gr i x@ 
if and only if a 7 ffr / fi. Then A gr [ and > gr i are called graded lexicographic 
orders. 
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Definition 27.2.4 Define the relation y gre i on W n by for all a,(3 G W 71 , 
a >~ gTe i (3 if either |a| > \(3\ or |a| = \(3\ and the right-rrwst nonzero entry in 
a — j3 G Z n is negative. Define the relation > gre i on the set of monomials of 
K[x i,. ..,x n ] by x a > grei x& if and only if a y gre i (3. Then y gret and > gre i are 
called graded reverse lexicographic orders. 


(1,2,3) H (1,1,4) since (1,2,3) - (1,1,4) - (0,1, —1) and 1 > 0. 


(1,2,3) y^ (1,1,4) since |(1,2,3)| = 6 = |(1,1,4)| 
and (1,2,3) y, (1,1,4). 

(1,2,3) ygrei (1,1,4) since |(1,2,3)| = 6 = |(1,1,4)|, but 

(1,2,3) - (1,1,4) = (0,1,-1) and - 1 < 0. 


(1,2,3) ygri (1,3,1) and (1,2,3) y gre i (1,3,1) since 
1(1,2,3)| = 6 > 5 = |(1,3,1)|. 

We ask the reader verify the following result. 

Theorem 27.2.5 The lex ordering on W n is a mnnomial ordering. ■ 

Definition 27.2.6 Let f = a a :r a £ X[xi,...,x n ], f 0 and > be a 
m,onom,ial ordering. 

(i) The multidegree of f, written multideg(f), is defined to be 

max{a ew"|a a /0}, 

where the maximum is taken with respect to >; 

(ii) the leading coefficient of f, written LC(f), is defined to be a^ G K , 
where p = multideg{f)\ 

(Hi) the leading monomial of f, written Mf(/), is defined to be 

where p = multideg (/); 

(iv) the leading term of /, written LT(f ), is defined to be LC(f)-LM(f). 

Example 27.2.7 Let f = 2 x^z 2 — 3 x 4 y 2 z 2 + bxy 3 + 5 y^z 2 with x > y > z. 
Then, with respect to lexicographic ordering we have 
m.ultideg(f) = (4,2, 2) 

LC(f) = -3 
LM(f) = x A y 2 z 2 
LT(f) = -3 x 4 y 2 z 2 . 
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The following theorem is a generalization of Theorem 14.1.4. 


Theorem 27.2.8 Let f,g E K[x \,..., x n \ be nonzero. Then the following 
properties hold. 

(i) m,ultideg(fg) = multideg(f)+m,ultideg(g ); 

(ii) If f + g 7 ^ 0, then multideg(f + g) < imLx{multideg(f),m,ultideg(g)} with 
equality holding when m.ultideg(f ) ^m,ultideg(g). ■ 


We now consider the division algorithm for polynomials of several variables. 
The idea is to divide a polynomial / in K[x\,..., x n ] by polynomials fa,..., ft € 
K[x\, ..., x n ] to obtain quotients q\, .. ., qt and a remainder r £ K[x i,. .., x n ] 
such that / = qifi + • • • -f q t f t 4 - r. We illustrate the procedure with some 
examples before we state the actual division algorithm. 


Example 27.2.9 Let f = x 2 y 2 + y 4- 1, /i = xy 4 1, and fa = y 4- 1. We use 
/ex order with x > y. Since LT(fa) > LT(fa ), u;e list fa first in the following 
scheme: 


xy 4 1 
V + 1 


91 : 

92 : _ 

x 2 y 2 + y + 1 
x 2 ?/ 2 4- xy. 


The leading term,s, LT(fa) = xy and LT(fa) = y, both divide LT(f). Hence, 


divide f by fa first. We obtain 



9i : zy 


92 : 

xy 4 1 

:r 2 y 2 + y 4 l 

y + 1 

9 9 

x y + xy 

-xy + y 4 1. 


Both LT(fa) a.nd LT(fa) divide —xy. Hence, we divide — xy + y 4 1 by LT(fa). 
This tim.e we obtain 


gi : xy - 1 
92 : 

—■n—n- 


xy 4 1 

x 2 y 2 4 y 4 1 

y + 1 

x 2 y 2 4 xy 


-xy 4 y 4 1 
-xy - 1 


y + 2. 


77ws //me LT(fdoes not divide y, but LT(fa) does. Hence, we divide v 4 - 2 
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by LT(^f 2 } to obtain 


91 : xy ~ 1 

92 : 1 

—n—rv- 


xy +1 

x 2 y 2 + y + 1 

y + 1 

x 2 y 2 + xy 


-xy + y + 1 


-xy - 1 


y + 2 


y + 1 


1. 


Thus, we have that x 2 y 2 + y -f- 1 = (xy — 1 )(xy + 1) + l(y + 1) +1. 

In the following example, we illustrate a slight complication of this proce¬ 
dure. 

Example 27.2.10 Let f = x 2 y + xy + y, fi — x 2 + y, and f 2 — y 2 + 1. We use 
lex order with x > y. Since LT(f\) > LT(f 2 ), we list f\ first in the following 
schem.e: 


91 ■ V 

92 : 

nr'^'o1 _L 


x 2 + y 

x 2 y + xy + y 

y 2 + 1 

x 2 y + y 2 

xy - 1 / + y. 


Now neither LT(fi) nor LT(f 2 ) divides xy. Hence, we pull xy out as a remain¬ 
der. We thus arrive at 



9i : y 


92 : 

x 2 + y 

x 2 y + xy + y 

j/ 2 + i 

x 2 y + y 2 


~V r +y~ r : xy. 

Now LT(fi) does not divide —y 2 , but LT{f 2 ) does. Hence, 



91 • y 


92 : “1 

x 2 + 7 / | 

x 2 y -4- XT/ -h 2/ 

y 2 + l 1 

x 2 y + y 2 


-y r + y 

-y 2 - 1 


—U + 1 

rit^LC 
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Now neither LT(/i) nor LT(/ 2 ) divides y. Hence, we pull out y 4- 1 as a re¬ 
mainder to obtain 



91 : y 


Q2 : -1 

x 2 + y 

x 2 y + xy + y 

y 2 + 1 

x 2 y + y 2 


-y l + y 
-y 2 - 1 


r : xy + y + 1. 

Hence, x 2 y 4- xy 4- y = y{x 2 By) — 1 (y 2 4- 1) 4- xy 4- y 4- 1. 

Theorem 27.2.11 (Division Algorithm) Lei > be a m,onom.ial ordering on 
the set of m.onomia,ls of K[x \,..., x n }. Let /, fi,ft E K[x i,..., x n ], where 

LT(fi) > LT(h) >■■■> LT(f t ). 

Then there exist q\,... , qt, r E A[:ei,..., x n ] such that f = q\f\ + • - • AqtftEr, 
where either r = 0 or r is a K-linear com.bina.tion of m.onom.ials, none of which 
is divisible by any of LT(/i),..., LT(ft). Also, multideg(f) > m,ultideg(qifi) for 
those qifi ^ 0. ■ 

We ask the reader to show in Exercises 3 and 4 that if the order of f \,..., ft 
by which we divide / is altered, then the remainder r may also be altered. 

Definition 27.2.12 An ideal I C K[x i,..., x n ] is called a monomial ideal 
if there exists A C W n such that I = {{x a | ol £ A}) . 

If / = {{x“ | a € A}) is a monomial ideal, then every element of I is a finite 
sum of the form Ec*eA h a x a , where h a E K[x i,..., x n \. 

Lemma 27.2.13 Let I = {{x a | a E A}) be a monomial ideal. Then a, m.ono- 
mial x@ E I if a.nd only if x a \ x@ for som.e a E A. 

Proof. If x a | x@, then there exists h a E K[x \,..., x n \ such that x@ — 
h a x Q E I. Suppose x@ E I. Then x@ = E;=i hiX a( ^\ where hi E K\x i,... ,x n \ 
and a{i) E A. Now hi — hj k tJ E K for all i,j. Thus, x& — E;=i 

(EfJi kij xPM)x a ^ and so x? = Zl=i E^i hj x^ +a ^. Hence, (5 = /3(ij) 
4- j = 1,... ,m;; i = 1,,77, except for those i and j which drop out 

when like terms are combined. Now x a ^ j x^^ +a ^\ j = 1, ... , 777 ,; ; i = 1, 
..., n. Thus, | , i = 1,..., tt.. ■ 

a + W n = {a ±j | 7 E W 71 } 

“ ku ^ hc . Maf4a##ia£lea/ 
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consists of the exponents of all monomials divisible by x a . If 

i = (*v,*V,*V), 

then the exponents of the monomials in I form the set 

((4, 2) + W n ) U ((3,4) + W n ) U ((2,5) + W n ). 

Lemma 27.2.14 Let I be a monomial ideal and f £ K[x i,... ,x n ). Then the 
following are equivalent. 

(i) f el. 

(ii) Every term, of f lies in I. 

(in) f = a a x a for som,e a a £ K and x a £ /, 0 < a < t. 

Proof. That (iii) => (ii) => (i) is immediate. 

(i) (iii): Since f £ /, / = x a ^ f° r some h n £ K[xi,...,x n ] 

and a(i) £ A, where I — ({x a | a 6 A}) . Now hi = YfJjLi &ij i = 

1,..., g, kij £ K for all i,j. Thus, 

q vra 

f = EE fc^x^V® (27.7) 

7=1 j = 1 

Therefore, (iii) holds since x^ z ^x a ^ is a monomial in I. ■ 

Corollary 27.2.15 Let I and J be m.onom.ial ideals. Then I = J if and only 
if I and J have the sam.e m.onom,ials. 


Proof. Let I = ({ x a \ a £ A}) and J = {{x^ | (3 £ B}^ . Suppose / and J 

have the same monomials. Thenrr^ £ ({rr a | a £ A}) and x a £ x@ | (5 £ B }^> 
Hence, 

<{*“ | a € A}) = ({x' 3 |/5 6 B}). 

The converse is immediate. ■ 


Theorem 27.2.16 (Dickson’s Lemma) Let I = ({a: a | a £ A}) C K[x i, 

..., x n ] be a m,onom.ial ideal. Then there exist o(l),..., o(s) 6 A such that 
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Proof. If n = 1, then I = {{z“ | a G A}), where a G A C W. Let (5 be 
the smallest element of A. Then x^\x® for all a G A and so I = (^ x i^ • Now 
assume that n > 1 and that the theorem is true for K[x i, ... ,x n -i\. Consider 
K[x i,..., x n -\,y\ so that the monomials in K[x i,..., z n _i ,y\ can be written 
x^y 771 , where a — (oi,..., o n -i) € W n_1 and m G W. 

Suppose that I C 77[zi,..., z n _i,7/] is a monomial ideal. Let 

J = ({z a | x a y m G I for some m}) 

in K[x i,..., x n _i]. Then by the induction hypothesis, there exist z 0 ^ 1 ),..., z Q ( s ) 
(x a ^ G {z a | z a ?/ m G 7 for some m}) such that J = (x a ( l \ ..., z a ( s ^ in 
7f[zi,..., z n _i]. By the definition of J, x a ^y Tni G / for some > 0 ,i = 
1,..., s. Let m — max{mi,... ,m s }. Let = <^{z^ | x@y k G 7}^} in K[x\ ,..., 
z n _i], fc = 0,... ,m — 1. By the induction hypothesis, there exist x ak ^\ x ak ^ 2 \ 
..., such that , z afe ( Sfe )y» in K[xi ,..., z n _i], where z afc( ^ 

G {z^ | x@y k G 7}. 

We now show that 7 is generated from the following list of monomials 

z a ( 1 )i/ m ,..., x a ^y m from J 
z ao ^\ ..., z a °( s °) from Jo 
x ai ^y ,..., z Ql ( Sl ^ from Ji 

x O! m _i(l)^Tn-l^ ^ from J m _l- 


Let z Q y p G 7. If p > m, then x a y p is divisible by some x a ^y m by the 
construction of J and Lemma 27.2.13. If p < m — 1, then x oc y p is divisible by 
some x a P^y p by the construction of J p and Lemma 27.2.13. It follows from 
Lemma 27.2.13 that the above monomials generate an ideal having the same 
monomials as 7. By Corollary 27.2.15, these ideals are the same. 

Now x ak ^y k G 7, but we don’t know x ak ^y k G {z a | a G A} C 7f[zi, 
..., z„], where x n = y. We write 7 = (x^^ l \ . .., z^( s ^ , where z(= 7, z = 

1, ..., s. By Lemma 27.2.13, each z^ is divisible by some z a ^ G A and 7 
= ({z a | a G A}) . Thus, 

ID (x^ l \...,x^) D (xW\...,xM s ^ =1. 


Hence, 7 = ^°( 1 ),...,z Q ( s )).B 


The ideal 7 in the following example already has a finite basis. However, it 
is merely our intention to illustrate the proof of Dickson’s lemma. 
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Example 27.2.17 Suppose that 

T / 5 2 23 2 5\ 

I = {x y ,x y ,x y J . 

Then 

J — x 2 , x 2 ^ = ( ;r2 ) ? ' n ^IXI- 

Clearly m = 5. iVoic 

Jo = | x ' 3 £ /}) = W = { 0 } 

J! = /{x * 5 | xPy € /}J> = ( 0 ) = { 0 } 

J 2 = ({a^ I x@y 2 £ J}\ = (X 5 ) 

J 3 = ({a ;^ 3 | x^y 3 £ I}\ = (x 2 ) 

J\ =\[x& | x@y^ £ /}y = { x 2 ) . 

Hence, 

I = (x 2 y 5 ,x 5 y 2 ,x 5 y 3 ,x 2 y 4 ^ . 

Corollary 27.2.18 Let > be a relation on W 71 satisfying the following prop¬ 
erties: 

(i) > is a total ordering on W n ; 

(ii) if a > (3 and 7 E W n , then a + 7 > (3 + 7 . 

Then > is a well-ordering if and only if a > 0 for all a E W n . 

Proof. Assume > is a well-ordering. Let oq be the smallest element of W n . 
Suppose that 0 > ao- Adding nao to both sides we obtain nao > (n 4 - l)ao, 
where n is a positive integer. Hence, 

0 > <ao > 2 «o > ■ • ■ > nao > (n + l)ao > ■ ■ • 

is an infinite descending sequence, a contradiction. 

Conversely, suppose that a > 0 for all a E W n . Let A C W n be nonempty. 
Since I = ({x a | a E A}) is a monomial ideal, we have by Dickson’s lemma 
that there exists a(l), ..., a(s) E A such that I — ^a; 0 ^ 1 ),..., x a ( s ^ . There is 
no loss in generality in assuming a(l) < a(2) < • • • < a(s). Let a £ A. Then 
x a £ I and so x a ^ divides x a for some a{i). Thus, a = a(i) + 7 for some 7 E 
W n . Therefore, 7 > 0 and so by (ii), a = a(i) + 7 > a(i ) + 0 = a{i) > a(l). 
Consequently, a(l) is the smallest element of A. ■ 

Hence, in Definition 27.2.1, (iii) can be replaced by the simpler condition 
that a > 0 for all a £ W n . 

~Pu+lc- 



27.2. GROBNER BASES 


591 


Definition 27.2.19 Let I be an ideal of K[x\,... ,x n ], I ^ (0) . Let 

LT(I ) = { cx a ] there exists f £ I with LT(f ) = ex 0 }. 

Example 27.2.20 Let y > x. Let I — (/i,/ 2 ) , where f\ = y 3 — yf(x ) and. 
f2 = y 2 x — x f( x ) + Vi where f £ K[x,y] is a nonzero polynomial in x alone. 
Use the grl ordering on monomials in K[x,y]. Then y 2 = y(y 2 x — xf(x) + 

y) — x(y 3 — yf{x)) £ I. Thus, y 2 = LT(y 2 ) £ { LT(I )) . By Lemma 27.2.13, 

y 2 £ (LT(fi), LT(f 2 )) since LT(fi) does not divide y 2 and LT{f 2 ) does not 
divide y 2 , using grl ordering. Hence, {LT(f\),LT(f 2 )) C { LT(I)) . 

Theorem 27.2.21 Let I be an ideal of K[x i,... ,x n ]. Then 

(i) (LT{I)) is a m.onomia.l ideal, 

(ii) there exist g \,... ,gt £ I such that 

(LT(I)) = (LT( 9l ),...,LT(g t )). 

Proof. (i) Let g £ /\{0}. Recall that LM(g) = x m , where m, — multideg(g) 
= max{ a £ W n | a a ^ 0}, where g = ]T] a a Q x Q . Recall also that LT(<?) 
= a a LM(p), where a is the multidegree of g. Since LM(p) and LT(p) differ 
by a nonzero constant, ({LM(p) | g £ J\{0}}) — (LT(J)) . Thus, (LT(J)) is a 
monomial ideal. 

(ii) Since (LT(/)) is generated by the monomials LM(c/) for g £ /\{0}, 
(LT(/)) = (LM(pi),..., LM(pt)) for finitely many g\,... ,gt £ I by Dickson’s 
lemma. Since LM(^) differs from LT(pj) by a nonzero constant, it follows that 

(LT(/)) = (LT( 9l ),...,LT( St )).B 

In the following result, we have selected one particular monomial order to 
use the division algorithm and to compute leading terms. 

Theorem 27.2.22 (Hilbert Basis Theorem) Let I be an ideal of K[x i,..., 
Xrf\. Then there exist gi,... ,gt Si such that I = (<7i,... ,gt) ■ Thai is, I has a 
finite generating set. 


Proof. If / = {0}, then I = (0) . Suppose that I contains some nonzero 
polynomial. By Theorem 27.2.21, there exist gi, ..., g t £ I such that (LT(/)) = 
(LT(pi), ... ,LT(p t )) . Clearly (gi,...,g t ) Q I. Let f £ I. Divide / by gi,... ,g t 
to get an expression of the form 


/ = aigi H-1- a t g t + r, 

where every term in r is divisible by none of LT(^i),... ,LT (gt). Now 


— / ~ ( a i9i + - • + a t gA £ I. 
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If r 7^ 0, then LT(r) E (LT(/)) = (LT(pi),... , LT(<?t)) and so by Lemma 
27.2.13, LT(r) is divisible by some LT(^). This contradicts what it means to 
be a remainder. Hence, r = 0. Thus, 


/ — a i9i + • • • + &t9t £ {9i > • • • > 9t) - 
Therefore, / C (g u ... , g t ) . Consequently, I = (gi, ..., gt) ■ ■ 

In addition to answering the ideal description question, the basis {<71, ..., gfi} 
used in the proof of Theorem 27.2.22 has the special property that 

(LT(/)) = (LTb 1 ),...,LTto)). 

Definition 27.2.23 Fix a m.onomial ordering. A finite subset G = { 51 ,... ,gt] 
of an ideal I is said to be a Grobner basis (or standard basis) if 

(LT( dl ),...,LT(g t )) = (LT(I)). 

Corollary 27.2.24 Fix a, m.onomial order. Let I be a nonzero ideal of K[x \, £ 2 , 
..., x n \. Then I has a Grobner basis. Furthermore, any Grobner basis for I is 
a basis for I. 


Proof. The set G = {<?i,..., gt} constructed in the proof of Theorem 27.2.22 
is a Grobner basis by definition. Note also that if 

(LT(/)) = (LT(<7 i),..., LT(pi)), 

then the argument in Theorem 27.2.22 shows that I — (g\, ..., gt) so that G is 
a basis for I. M 


Example 27.2.25 Let y > x. Consider I in Example 27.2.20. { y 3 — yf(x ), 
y 2 x — xf(x) + y} is a basis for /, but not a Grobner basis since y 2 E ( LT(I )) , 
but y 2 £ (LT(y 3 - yf(x)), LT{y 2 x - xf(x) + y)) . 


Example 27.2.26 Let x > y > z. Consider the ideal J — ( x — z l ,y — z ■ 7 ) 
in R [x,y,z\, where i and j are fixed positive integers. To show that {x — z 1 , 
y—z 3 } is a Grobner basis for J, it suffices to show th.ai the leading term of every 
nonzero element in J lies in (LT(x — z l ), LT(y — z 3 )) = (x,y). By Lemma 
27.2.13, this is equivalent to showing that the leading term of any nonzero 
element of J is divisible by either x or y. Consider any f = Ag\ + Bg 2 E J, 
where g\{x,y,z) — x — z 1 and g 2 (x,y,z) = y — z 3 . Suppose that / 7^ 0 and x 
and y do not divide LT(f). Then by the definition of lex order, f must be a 
polynomial in z alone. However, f vanishes onV = V{x — z l ,y—z 3 ) C R 3 since 
f E J. Clearly, (f 1 , t 3 , t) E V for any real num,ber t. The only polynomial in z 
alone that vanishes a,t all these points is the zero polynomial, a contradiction, 

since f A 0. Hence, { 01 , 02 } a Grobner basis for J. 

“Pufix*. 7^af4c#fia£uiaZ 
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Theorem 27.2.27 Let I be an ideal of K[x i,...,x n ], G — {gi,---,gt} be a 
Grobner basis for /, and f E K[x i,.. . , x„]. Then there exists a unique r E 
K\x i,. .., x n ] such that 

(i) no term, of r is divisible by one of LT(gi ),... ,LT(g t ) ) 

(ii) there exists g E I such that f — g + r. 

Proof. The division algorithm gives f = a\g\ + • • • + a t gt + r, where r 
satisfies (i). g satisfies (ii) by letting g = aigi + • • • + atgt- 

To prove uniqueness, suppose that f = g\ + T\ = 92 E r 2 with (i) and (ii) 
holding. Then r 2 — rq = g\ — g 2 E I. Thus, if r 2 — r\ ^ 0, then LT(r2 — ri) E 
LT(7) = (LT(^i),... , LT(c^)) . By Lemma 27.2.13, it follows that LT(r2 — rq) is 
divisible by some LT(^). This is impossible since no term of r i, r 2 is divisible 
by one of LT(^j),... ,LT(gt). Therefore, rq — r2 = 0, i.e., rq = r2- (Every term 
of 7*2 — rq is a term of either ri or r2, except for a constant multiple.) Hence, 
9\ = 92 also. ■ 

By Theorem 27.2.27, we can list the elements of G in any order when 
dividing f by G since the remainder r is unique. 

Corollary 27.2.28 Let I be an ideal of K[x \,..., x n } and G = {g i,..., gt} be 
a Grobner basis for I. Let f E K[x i,... ,x n ]. Then f E I if and only if the 
remainder on division of f by G is zero. 

Proof. If r = 0, then / = a\g\ + • ■ ■ + a t g t E I. Conversely, suppose that 
/ E I - Then / = / + 0 satisfies (i) and (ii) of Theorem 27.2.27. Thus, r = 0 
by the uniqueness of r. ■ 

Definition 27.2.29 Let f F denote the remainder on division of f by the or¬ 
dered s-tuple F = (/i,.. -, / s ). 


If F in Definition 27.2.29 is a Grobner basis for {/i, - -., / s ), then we can 
regard F as a set (without any particular order) by Theorem 27.2.27. 

Example 27.2.30 Let 


F = ( x 2 y - xy , x*y 3 - xy) 


and 

f(x,y) = x 5 y 2 . 

Use lex ordering with x > y to obtain 



-xy 2 . 
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qi■ 0 

q 2 : x 3 y + x 2 y + xy + y 

4 

x y — xy 

x 3 y 2 

0 

x y — xy 

x 5 y 2 — x 4 y 2 


A 2 

xy 

4 2 3 2 

xy — xy 


x 6 y 2 

x 3 y 2 — x 2 y 2 


x y 

2 9 9 

xy — xy 


- T 

xy z . 


Example 27.2.31 Let y > x. Consider the ideal I of Example 27.2.20. Then 
I = (fuh) > where fi = y 3 - yf(x ) and f 2 = y 2 x - xf(x) + y. Use the grl 
ordering. Now y{y 2 x — xf{x) + y) — x(y 3 — yf(x)) = y 1 . Thus, y z € I. Now 
LT(h)Xy 2 and LT(h)fy 2 . Hence, 

LT(y 2 ) =y 2 $ (LT{h),LT(h)). 

We see that xLT(fi) — yLT{f 2 ) = 0. That is, the leading terms in xfi — yf <2 
cancel, leaning only sm.aller terms. is n ot a Grobner basis because 

(LT(I)) £ (LT(h),LT(h)). 

Note: LT(y 2 ) € LT(I) and LT(y 2 ) £ (LT(h), LT(h)) ■ 

Definition 27.2.32 Let /, g be nonzero polynomials in K[x \,... ,x n \. 

(i) Let multideg(f) = a, m,ultideg(g) = (5, a,nd 7 i = majc{aj, /3^}, i — 
1 , 2 , ..., n. Let 7 = (71 , • ■ •, 7n)- Then x 1 is called the least common multiple 
of LM(f) a,nd LM(g), written 

L = x 1 = LCM(LM(f), LM(g)). 

(ii) The polynomial 

S ^f' ^ = LT(f) ‘ f ~ LT{g) ’ 9 
is called the S -polynomial of f and g. 


Example 27.2.33 Let y > x. In Example 27.2.31, let f(x) = x so that fi = 
y 3 —yx and /2 = y 2 x—x 2 + y in R[x, y] with the grl order. Then m,ultideg(fi) = 
(3,0) and m.ultideg(f 2 ) = (2,1). Thus, L = y 3 x and 


_ y°x r y°x / 

— y s Ji y 2 x h 

- x fi - yh 


9 9 9 

— — yx + yx — y 


= -y ■ 

Pufuc. 7feoiAe»tatLca/ PlLy-SALcS. 
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The purpose of an 5-polynomial S(f, g ) is to produce cancellation of leading 
terms. The next lemma shows that 5-polynomials are involved in every such 
cancellation. 


Lemma 27.2.34 Consider the sum. CiX a ^g 2 , where ci,.. 

stants and 

a(i) -I- m,ultideg(gi) — 8 £ W n 


whenever Ci ^ 0. If 


m.ultideg(^2, c iX a ^gi) < < 5 , 


l 


then there are constants Cjk such that 


ct ore con- 


E 


CiX a ^gi = J2 c J kx6 ljk S(9j,9k), 

j,k 


(27.8) 


where x 1 i k = LCM(LM(gj), LM(gi c )). Furth.erm.ore, each x 6 1 i k S(gj,gk ) has 
m,ultidegree < 6. ■ 


In Eq. (27.8), every summand CiX a ^gi on the left has multidegree 6. Thus, 
the cancellation occurs after the summands have been added. Each summand 
on the right has multidegree < 6. Hence, the cancellation has already occurred. 
We see that the 5-polynomials account for the cancellation. 


Theorem 27.2.35 Let I he an ideal in K[x \,..., x n \. Then a basis G = 
{9ii ■ ■ •: \9t } for I is a Grobner basis for I if and only if the remainder, S{gi,gj ) G , 
on division of S(gi,gj ) by G (listed in som,e order) is zero for all i,j with 

3- ■ 

Example 27.2.36 Consider the ideal 

1= (y- X 2 ,z- X ^' 


We show with the help of Theorem, 27.2.35 that G = {y — x 2 ,z — x 3 } is a 
Grobner basis for lex order with y > z > x. Consider S(y — x 2 ,z — x 3 ). Now 


S(y — x 2 , z — x 3 ) = ~(y — x 2 ) — —{z — x 3 ) = — zx 2 + yx 3 . 

V z 


Qi ■ x'- 


q 2 : -x z 


2 

y-x 

'J ' T 

yx — zx* 



z — X 3 

yx — x 




—zx 2 

+ 

X 5 


2 

— zx* 

+ 

X 5 


~Pu*lXZ- Tfai 
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Thus, 

—zx 2 + yx 3 = x 3 {y — x 2 ) + {—x 2 )(z — x 2 ) -f 0 . 

Hence, 

S{y — x 2 , z — x 3 ) G = 0 . 

Therefore, G is a Grobner basis for I. 

Now consider G with lex order x > y > z. Then g\{x,y,z) = —x 2 + y and 
g 2 (x,y,z) = —x 3 + z. Now multideg{g\) = (2,0,0) and m,ultideg(g 2 ) = (3,0,0). 
Thus, 7 = (3,0,0). Hence, 

S(-x 2 + y, -x 3 + z) = ^(-x 2 + j/)-^(-x 3 + z) 

= x 3 — xy — x 3 z 
= — xy + z. 

qi : 0 
92 : 0 
—xy + z 

0 _ 

—xy + 2 

—xy + z — 0 (— x 3 + z) + 0 (— x 2 + y) + {—xy) z = —xy + z ^ 0 . 

Therefore, {—x 2 4 - y, —x 3 + z} is not a Grobner basis with lex order x > y > 
z. 


-x 3 + 2 
— x 2 + y 


Every ideal in K[x i,..., x n ] has a Grobner basis, but the proof of this result 
was nonconstructive. We will now show how to construct a Grobner basis. 


Example 27.2.37 Let y > x. Consider K[x,y\ with grl order and let I = 
(/ij/ 2 ) i where fi = y 3 — yx and /2 = y 2 x — x 2 4 - y. By Example 27.2.31, 


{/i,/ 2 } is not a Grobn 
By Example 27.2.33, 


Now 


basis since LT(S{fi, / 2 )) = 

s(/i,/ 2 ) = V- 



qi ■ 0 


92 : 0 

y 3 - yx 

- y 2 

y 2 x — x 2 + y 

0 


V 2 t (LTifO, LT(f 2 )). 


The remainder of S{fi, / 2 ) upon division of f\, f^ is not zero. Hence, we should 
include the remainder in our generating set. Let F = {/i,/ 2 , fz}, where fa = 

Vitti-CL T^oiAoftai^Lca/ 
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-y 2 . Then S(fa,fa) = h- Consequently, S(fa,fa) F = 0. Consider fa and fa. 
Then multideg(fi) — (3,0 )and m,ultideg(f 3 ) — (2,0). Therefore, L = 1 / 3 . Thus, 

S(fi,h) = (y 3 /y 3 )(y 3 -yx) - (y 3 /(-y 2 ))(-y 2 ) = -yx. 


but 

S(fa,fa) F = -2/x 7 ^ 0. 

Therefore, we must add fa = —yx to our generating set. Let F = {fa, fa, fa, 
fa}. Then 

5(/i,/ 2 ) f = S(/ 1: / 3 ) F = 0. 

Consider fa and fa. Then multideg(fa) = (3,0) and multideg(fa) = (1,1). 
Hence, L = y 3 x. Thus, 

S{fi,fa) = x(y 3 - yx) - (-y 2 )(-yx) - -yx 2 = xfa 

and so S(fa,fa) F = 0. Consider f 2 and fa. Then multideg(fa) = (2,1) and 
multideg (fa) = (2,0). Thus, L — y 2 x. Hence, 

S(f 2 , fa) - (y 2 x - x 2 + y) - (- x)(-y 2 ) = -x 2 + y, 


but 

S(h,f 3 ) F = -x 2 +y^0. 

Thus, we must also add fa = —x 2 + y to our generating set. Let F — 
{fa, $2, fa, fA, fa}- Then one can show that S(fa,fj) F = 0 for all 1 < i < 
j < 5. Hence, by Theorem. 27.2.35, {fa, fa, fa, fa, fa} is a Grobner basis for I . 

Lemma 27.2.38 Let I be an ideal in K[x 1 ,..., x n ]. Let G be a Grobner basis 
for L Let g E G be a polynomial such that LT(g) E (LT(G\{g})). Then G\{g} 
is also a Grobner basis for I. 

Proof. We have that (LT( 6 r )) = (LT(/)) . Suppose that 

LT(s) € <LT(G\{g})). 

Then (LT(G\{£f})> = <LT(G)) . Consequently, (LT(G\{g})> = <LT(/)) and so 
G\{g} is a Grobner basis for I. ■ 

Definition 27.2.39 Let I be an ideal in K[x\, ..., x n ] and G be a Grobner 
basis for I. Then G is a minimal Grobner basis for I if the following con¬ 
ditions hold: 

(i) LC(g) — 1 for all g E G and 

(n) for all geG, LT(g) (LT(G\{g })>. 

“Piifix*. 7feaiAe»tafLca/ 
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Example 27.2.40 Let, y > x. R is possible to construct a minimal Grobner 
basis for a nonzero ideal by applying the above procedure a.nd then using Lemma. 
27.2.38 to eliminate any unneeded generators. In Example 27.2.37, using grl 
order, u>e constructed the Grobner basis 

h = y 3 - yx 

f 2 = y 2 x - x 2 + y 

h - -y 2 
h — ~y x 
h = -x 2 + y. 

We multiply fa, f^, and fa by —l to make the leading coefficients of the gen¬ 
erators equal to 1. Now LT(fa) = y 3 = -y-LT(fa). By Lemma 27.2.38 ; we can 
eliminate fa. Similarly, since LT(fa) = y 2 x — -y-LT(fa), we can eliminate 
fa. There are no m.ore cases, where the leading term, of a generator divides the 
leading term, of another generator: 

y 2 Xyx, y 2 /x 2 , yx/y 2 , yx fx 2 , x 2 fy 2 , x 2 fyx 


Hence, 


93 = y 2 , 9a : yx, 95 =x 2 -y 


is a minimal Grobner basis for I. Now y 2 + ayx, yx, x 2 — y is also a, minim,al 
Grobner basis for I, where a E K is any nonzero constant. Thus, for K infinite, 
there exists infinitely many Grobner bases. Hence, a Grobner basis for an ideal 
is not necessarily unique. 


Definition 27.2.41 Let I be an ideal in K[x \,..., x n \. Let G be a Grobner ba¬ 
sis for I. Then G is a reduced Grobner basis for I if the following conditions 
hold: 

(i) LC(g ) - 1 for all geG. 

(H) For all g E G, no monomial of g lies in { LT(G\{g })). 

In the above example, y 2 , yx, x 2 — y is a reduced Grobner basis for I : y 2 £ 
(yx, x 2 ) , yx £ (y 2 , x 2 ) , x 2 £ (y 2 , yx) , y (y 2 ,yx) . 

For the minimal Grobner basis y 2 + ayx, yx, x 2 — y, ayx E (yx, x 2 ) and so 
this basis is not reduced when a/0. 


Theorem 27.2.42 Let I be an ideal of K[x \,..., x n }. Then for a given mono¬ 
mial ordering, I has a unique reduced, Grobner basis. ■ 


We now consider the problem of solving polynomial equations. 

Theorem 27.2.43 Let I be an ideal of K\x\, x 2 ..., x n }. Let fi, fo • ■ ■, £ I 

be such that /=(/!,..., fm) ■ Then V(I) - V({f u ..., f m }). ■ 

Pit^Le. 7/laiAe»tatLca/ “PlLySfiLc-A. 
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Example 27.2.44 Consider the polynomial equations 

x 3 + y + z 2 — 0 

x 2 + z 2 = y (27.9) 

x — z. 


Let 

I = ^ x 3 4- y 4- z 2 , x 2 + z 2 — y, x — z^j C C[x,y,z\. 

By Theorem, 27.2.43, we can compute V(I) using any basis of I. Using lex order 
with x > y > z, we obtain the following Grobner basis: 

gi = x - z 

92 = V ~ 2z 2 

93 = z 3 4- 3z 2 . 

The polynomial 93 depends on z alone, and its roots are 

z = 0, — 3. 

Next, we can solve the equations gi = 0 and 92 = 0 uniquely for x and y, 

respectively, by substituting the value of z. Thus, the solutions of g\ — 92 = <73 

= 0 are (0,0,0) and (—3,18,—3). Since V(I) = V(<?i,p 2 >£ 3 )) we have found 
all solutions to Eqs. (27.9). 

In the above example, the variables are eliminated successively. Also, note 
that the order of elimination corresponds to the ordering of the variables. It 
does follow that lex order gives a Grobner basis that successively eliminates 
the variables. 


27.2.1 Worked-Out Exercises 

0 Exercise 1 Let / = (/i,/ 2 ), where f\ — xz — y 2 and f 2 = x 3 — z 2 in 
C[x,y,z\. Use grl order with x > y > z. Let / — —4a : 2 y 2 z 2 4- y^ 4- 3z 5 . 


(i) Show that {/ 1 , / 2 } is not a Grobner basis for I. 

(ii) Find a Grobner basis for I. 

(iii) Determine if f €. I. 

(iv) Show that g = xy — 5 z 2 4- x £ I. 

Solution: (i) multideg(/i) = (1,0,1) and multideg(/ 2 ) = (3,0,0). Thus, 
7 = (3,0,1) and so 

S{h,h) = -(xz- y 2 ) - ^(x 3 - z 2 ) = -x 2 y 2 + z 3 . 

XZ X 

LT(S'(/i,/ 2 )) = -a : 2 y 2 eLT(I) since ~x 2 y 2 + z 3 = x 2 /i - zf 2 G I. But 


-xV t <LT(A),LTf/ 2 )> = [xz,x 3 ) 


it^Le ma^/ic#«ia^uia/ “Plty-AlcS. 
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(ii) Let fi = xz - y 2 , f 2 = x 3 - z 2 , / 3 = x 2 y 2 - z 3 , / 4 = xi/ 4 - z 3 , and / 5 = 

y 6 — z 5 . Then G = {/i, / 3 , / 4 , /s} is a Grobner basis for /. G is a reduced 

Grobner basis. 

(iii) Divide / by G. We obtain / = 0/i + O /2 — 4z 2 / 3 + 0 / 4 + I /5 + 0. 

Since the remainder is zero, we have / £ I. 

(iv) LT(p) = xy £ (LT (G)) = {xz, x 3 , x 2 y 2 , xy A , y 6 ). Hence, g G ^ 0, so 
that g I. 

27.2.2 Exercises 

1 . Let / = 3 x 2 y 3 z — bx 4 yz 2 + 3 xy — 2x € R[x, y , z]. Use lex, graded lex, and 
reverse lex orderings for the following determinations, where x > y > z. 

(i) Find multideg(/). 

(ii) Find LC(/). 

(iii) Find LM(/). 

(iv) Find LT(/). 

2. Let / = x 3 y 3 +y 2 , fi = xy 2 -YxRy 2 , and f 2 =y 2 ~y~l£ Q[z> 2 /]- Use 
lex order with x > y to divide / by f\ and f 2 . 

3. Let /, fi, and f 2 be defined as in Exercise 2. Use lex order with x > y 
to divide / by f 2 and / 1 , i.e., reverse the role of fi and f 2 . Compare the 
remainder with the remainder obtained in Exercise 2. 

4. Let / = x 3 yz 2 — 2 xyz 2 and g = x 2 z — xy 2 z + xz. Compute S(f, g) using 
the lex ordering with x > y > z. 

5. Suppose that I = {x A y 2 , x 3 y 4 , x 2 y 5 ) . In Dickson’s lemma, determine J, 
vn, and Jq, ■ ■ ■ •> Jm —l- 

6 . In Example 27.2.37, show that 

(i) S{fi,h) F = -yx with F = {A, / 2 , / 3 }; 

(ii) S(fi,f 2 ) F = 5(/i,/ 3 ) f - 0 with F = {/ 1 , / 2 , / 3 J 4 }; 

(iii) 5(/2 ,/ 3 ) f = y-x 2 with F = {A, A, A, A}- 

7. Show that a relation >- on W n satisfies (iii) of Definition 27.2.1 if and 
only if every strictly decreasing sequence in W 71 terminates. 

8. Prove that lex ordering on W n is a monomial ordering. 

9. Prove Theorem 27.2.8. 

10. Prove Theorem 27.2.43. 

“Ph-ySlc-A. 
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11. Consider the polynomial equations 

x 2 — y = 0 

x 2 z — 0. 

Find a reduced Grobner basis for the ideal / — ( x 2 — y , x 2 z ) in R [x,y, z\. 
Compute V(I). 





Selected Bibliography 


1. Adams, W. W., and Loustaunau, P. An Introduction to Grobner Bases, 
Graduate Studies in Mathem,atics, Vol. 3. American Mathematical Soci¬ 
ety, 1994. 

2. Aschbacher, M. The classification of finite simple groups. Mathem,atics 
Intelligencer , 3(2), 59-65, 1981. 

3. Artin, E. Galois Theory. Notre Dame, Ind.: University of Notre Dame 
Press, 1944. 

4. Barnes, W. E. Introduction to Abstract Algebra. Boston: D.C. Heath and 
Company, 1963. 

5. Becker, T., and Weispfenning, V., In cooperation with Kredel, H. Grobner 
Bases, A Computational Approach to Commutative Algebra, Graduate 
text in mathematics. New York: Springer Verlag, 1993. 

6. Bell, E. T. Men of Mathematics. 2d ed. New York: Simon and Schuster, 
1962. 

7. Berlekamp, E. R. Algebraic Coding Theory. New York: McGraw-Hill, 
1968. 

8. Burton, D. M. Elementary Number Theory. Boston: Allyn Sz Bacon, 
1980. 

9. Cohn, P. M. Algebra, Vols. 1 and 2. New York: Wiley, 1974, 1977. 

10. Cox, D., Little, J. and O’Shea, D. Ideals, Varieties, and Algorithms. New 
York: Springer Verlag, 1992. 

11. Edwards, H. M. The genesis of ideal theory. Arch. History Exact Sci. 
23, 321-378, 1980. 

12. Edwards, H. M. Dedekind’s invention of ideals. In Studies in the His¬ 
tory of Mathematics, E.R. Phillips, ed. The Mathematical Association of 
America, 1987. 





603 


13. Fuchs, L. Infinite Abelian Groups, Vols. 1 and 2. New York: Academic 
Press, 1970,1973. 

14. Gillispie, C. C., ed. Dictionary of Scientific Biography, Vols. 1-14. New 
York: Charles Scribner’s Sons. 

15. Goodearl, K. R. Ring Theory: Nonsingular Rings and Modules. New 
York: Marcel Dekker, 1976. 

16. Halmos, P. R. Naive Set Theory. New York: Springer Verlag, 1974. 

17. Hardy, G. H., and Wright, E. M. An Introduction to the Theory of Num¬ 
bers, 4th ed. Oxford, England. Clarendon Press, 1960. 

18. Herstein, I. N. Topics in Algebra. 2d ed. New York: Wiley, 1975. 

19. Hungerford, T. W. Algebra. New York: Holt, Reinhart and Winston, 
1974. 

20. Isaacs, I. M. Algebra. California: Brooks/Cole, 1994. 

21. Jacobson, N. Basic Algebra, Vols. 1 and 2. San Francisco: Freeman, 
1974, 1980. 

22. Karpilovsky, G. Topics in Field Theory. New York: North-Holland, 1989. 

23. Kiernan, B. M. The development of Galois theory from Lagrange to Artin. 
Arch. History Exact Sci. 8, 40-154, 1971-1972. 

24. Kleiner, I. The evolution of group theory: A brief survey. Ma,them,atics 
Magazine 59(4), 195-215, 1986. 

25. Kleiner, I. A sketch of the evolution of (noncommutative) ring theory. 
L’Enseignem.ent Mathem,atique , 33, 227-267, 1987. 

26. McCoy, N. H. The Theory of Rings. New York, Chelsea Publishing Com¬ 
pany, 1973. 

27. Pless, V. Introduction to the Theory of Error-Correcting Codes. New 
York: Wiley-Interscience Series Discrete Mathematics, 1982. 

28. Rotman, J. J. An Introduction to the Theory of Groups. Iowa, Wm. C. 
Brown, 1988. 

29. Rotman, J. J. Galois Theory. New York: Springer Verlag, 1990. 

30. Van der Waerden, B. L. A History of Algebra. New York: Springer Verlag, 
1985. 



604 


31. Zariski, O., and Samuel, P. Commutative Algebra, Vol. 1. New Jersey: 
D. Van Nostrand Co. Inc., 1960. 





Answers and Hints to 
Selected Exercises 


Exercises 1.1.2 (page 6 ) (Sets) 

1. A\JB = { x,y,z,w}\ AflB = {y}; A\B = {x,z}; B\A = {u>}; Ax B - {(x,y), (x.w), 
(y,y), (y,w), ( z,w )}; V{A) = {ct>,{x},{y} ) {z},{x,y},{x,w},{y,z},A}. 

3. (i) x G A U B if and only if x € A or x G B if and only if x G B or x G A. Thus, 
A U B = B U A. Similarly, A Pi B = B fl A. 

4. |P(S)| = 4096. 4095 subsets are properly contained in S. 

6 . (i) A A B = (a, d, e}. 

7. (ii) Note that A = (- 4 \B) U (A fl B) and ( A\B ) n {A fl B) = 4>. Now use (i). 

8 . (i) True (ii) False (iii) False (iv) True (v) True. 

Exercises 1.2.2 (page 19) (Integers) 

1. gcd(90, 252) = 18, s = 3 and t = — 1. 

2 . s = 239 and t — —353. 

3. s — 22 and t = —15. 

4. (ii) Let S(n ) be the statement: 7 n — 1 is divisible by 6 for all n G Z # . For n = 0, 

7° — 1 = 0, which is divisible by 6. Hence, S( 0) is true. Suppose S(n ) is true for some n > 0. 

Consider S(n -f 1) : 7 n+1 — 1 is divisible by 6. Now 7 n+1 — 1 = 7 n ■ 7 — 1 = 7 T3 '(6 + 1) — 1 = 
7 n • 6 + 7 n — 1. Now 7 n ■ 6 is divisible by 6 and by the induction hypothesis 7 n — 1 is divisible 
by 6 . This implies that 7 n ■ 6 + 7 n — 1 is divisible by 6 and so 7 n+1 — 1 is divisible by 6 . Thus, 
S(n + 1) is true. Hence, by induction 7 n — 1 is divisible by 6 for all n G Z # . 

5. (i) Suppose a\b. Then b — an for some n G Z. Thus, be = a(cn) for some n G Z and 
for all c G Z. Thus, a| 6 c for all c G Z. 

(iii) Suppose a |6 and a\c. Then b = an and c = am for some n,m G Z. Let i, y G Z. Now 

bx + cy = a(nx + my). Since n,m, x t y G Z, nx + my G Z. Hence, a\(bx + cy). 

7. ab 7 ^ 0. Now c ~ at and d — bs for some s, t G Z. Hence, cd = abts. Thus, ab\cd. 

9. Suppose gcd(m, n) = c. There exist u,v G Z such that m = uc and n = vc. Thus, 
a = cdu and b = edv and so cd\a and cd\b. Since gcd(a, 6 ) = d, cd|d. Thus, d = cdk for some 
k G Z. This implies that 1 = ck, and since c is a positive integer, c = 1. Consequently, 
gcd(m, n) = 1 . 

11 . There exist u,v,t,s G Z such that 1 = xu + yv and 1 = xt + zs. Thus, zs = 1 — xt. 
Now zs — zs ■ 1 = zs(xu + yv). Therefore, 1 — xt = xzus + yzvs and so 1 = x(t + zus) + yzvs. 
Hence, by Theorem 1.2.11, gcd(z,yz) = 1. 

14. Let b = a + 1. Now 1 = (a4-1) • l + a(—1). Thus, by Theorem 1.2.11, gcd(a, a+ 1) = 1. 

20. (i) True (ii) True (iii) True (iv) True (v) True 

Exercises 1.3.2 (page 28) (Relations) 


“Public. 
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1. (i) R = {(1> 1)j (1) 5)> (2j 2), (2, 6), (3,3), (3,7), (4,4), (5,1), (5,5), (6, 2), (6,6), (7,3), 
(7,7)}. (n)V(R) = A. (iii)l(R) = A. (iv) R^ 1 — (1,1), (1,5), (2, 2), (2,6), (3,3), (3,7), 

(4.4) , (5,1), (5,5), (6,2), (6, 6), (7, 3), (7,7)}. (v) V(R~ l ) = A. (vi) 1(R~ 1 ) = A. 

2. (i) R = {(1,1), (1,2), (1,3), (1,4), (1,5), (1,6), (2,1), (2,2), (2,3), (2,4), (2,5), (2,6), 
(3,1), (3,2), (3,3), (3,4), (3,5), (3,6), (4,1), (4,2), (4,3), (4,4), (4,5), (5,1), (5,2), (5,3), 

(5.4) , (6,1), (6,2), (6,3)}. (ii) No (iii) Yes (iv) No. 

3. (i) Yes (ii) No (iii) No (iv) No (v) Yes (vi) Yes (vii) No. 

5. [1] = {1,4,7}, [2] = {2,5,8}, and [3] = {3,6}. 

6. V(R) = A and 1(R) = A. 

8. [-1] = [5] - [11] = [23] and [2] = [8]. 

10. x—y - = nk and z — w = nl for some k, l £ Z. Thus, (x+z) — (y+w) = (x—y) + (z—w) = 
nk + nl = n(k + l ). Thus, x + z = n y + w. Also, xz — yz — nkz and yz — yw = nly. Thus, 
xz — yw = (xz — yz) + (yz — yw) - nkz + nly = n(kz + ly) and so xz = n yw. 

16. (i) Let (a, 6), (b, c) £ R°°. Then (a, b) £ R n and (6, c) £ R m for some positive integers 
m and n. Let m > n. Then (a,b), (b,c) £ R m . Thus, ( a,c) £ T?™ -1-1 C R^. Hence, R°° is 
transitive. 

(ii) Let (a,b) £ R°°. Then ( a,b) £ R n for some positive integer n. Thus, there exist 
ai,a2, • • • ,a n -i £ A such that (a,ai), (ai,<22 ),..., (a n -2, a, n -i), (a n -i, b) £ R. Hence, (a, ai), 
(a-i, ct2), .. -, (a n - 2 ,a n -i), (a n -i,b) £ T. Since T is transitive, it follows that (a, b) £ T. Hence, 
jR°° C T. 

17. Let (x,y) £ R 2 o R 1 . Thus, there exists z £ S such that (x,z) £ R\ and (z,y) £ R 2 . 
Since R\ and R 2 are symmetric, (y , z) £ R 2 and (z, x) £ Ri. Thus, (y, x) £ R 10 R 2 C R 2 oRi. 
Hence, R 2 o Hi is symmetric. Now (y,x) £ R 2 o R x . As before, (x,y) £ Ri o R 2 . Hence, 
i?2 0 Ri Q Ri o Thus, Ri o R 2 — R 2 o i?i. 

19. (i) True (ii) False (iii) False (iv) True (v) True. 

Exercises 1.4.2 (page 38) (Partially Ordered Sets) 

2. Let A = {1, 2} and R = {(1, 2)}. 

5. Yes. 

7. (i) a < a V ( 6 V c) and 6Vc< aV ( bVc). Thus, a, b, c < a V ( bVc). Since a V b exists, 
aVb<aV(bVc). Therefore, a V ( b V c) is an upper bound of {a V b, c}. Let x be an upper 
bound of {a V b, c}. Then a, b < aV b < x and c < x. Since b V c exists, 6 V c < x. Again a V ( 
bV c) exists. Hence, aV( by c) < x. Consequently, a V ( b V c) is the least upper bound of 
{a V b, c} and so a V (b V c) = (aV b) V c. 

8. (i) is not a lattice, (ii) and (iii) are lattices. 

9. 4 A ( 8 V 10) = 4 and (2 V (2 A 8)) V 20 = 20. 

10. (i) a < a V b and a < a V c. Thus, a is a lower bound of {(a V b), (a V c)}. Hence, 
a < (a y b) A (a y c). Now b A c < b < aV b and feAc<c<aVc. Thus, b A c is a lower bound 
of {(a V b), (a V c)}. Hence, b Ac < (a V b) A (o V c). Thus, (a V 6) A (a V c) is an upper bound 
of {a, b A c}. Hence, a V (b A c) < (a V b) A (a V c). 

13. (i) False (ii) False (iii) True. 

Exercises 1.5.2 (page 50) (Functions) 

1. (i) / is one-one and onto, (ii) / is one-one and onto, (iii) / is neither one-one nor 
onto. 

2. f is neither one-one nor onto. 

3. (/ o g)(x) = \/3x + 1, (p o f)(x) = 3y*x + 1, and f °g ^ go f. 

4- (go f)(x) = 2+ 

5. (i) g(x) = x — 2 is a left inverse of /. (ii) Let g : Z —> Z be defined by g(x) = | if x is 
even and g(x) = 1 if x is odd. Then g is a left inverse of /. (iii) / has no left inverse. 

6. (i) Let h.(x) = x + 3 for all x £ Z. Then h is a right inverse of /. (ii) / does not have 
a right inverse, (iii) / has no right inverse. 
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7. f\ : 1 > 1,2 -» 2,3 - 3; /a : 1 -> 1,2 -> 3,3 -> 2; / 3 : 1 - 2,2 -> 1,3 - 3; 

/ 4 : 1 — 3,2 -> 2,3 -> 1; / 5 : 1 2, 2 — 3,3 — 1 ; / 6 : 1 — 3,2 — 1,3 -> 2. 

10. (i) x G /(A U B) if and only if x = f(u) for some u G A U B if and only if x = f(u) 

for some u G A or u G B if and only if x G /(A) or x G /(B) if and only if x G /(A) U /(B). 
Hence, /(AUB) = /(A)U/(B). 

11. (ii) x G / - 1 (AnB) if and only if /(x) G AflB if and only if /(x) G A and /(x) G B 

if and only if x G / _ 1 (A) and x G / _ 1 (B) if and only if x G f~ 1 (A ) D / - 1 (B). Hence, 

r i (inB) = r i (d)nr i (B). 

13. Define / : Z —> E by /(n) = 2n. Then / is one-one and onto E. Hence, Z ~ E. 

15. (i) Define / : Z —■+ 3Z by /(n) = 3n. Then / is one-one and onto 3Z. Hence, Z ~ 3Z. 
19. Yes. 

21. (i) False (ii) False. 

Exercises 1.6.2 (page 55) (Binary Operations) 

1. (i) No (ii) Yes (iii) Yes (iv) Yes (v) Yes (vi) Yes. 

2 . (ii) (iii), (iv), (v), (vi). 

3. (ii) and (iv) have identity. 

Exercises 2.1.2 (page 77) (Elementary Properties of Groups) 

1 . (i) (N, *) is a semigroup but not a group, (ii) (Z, *) is not a semigroup and so not a 
group, (iii) (R, *) is a semigroup but not a group, (iv) (R, *) is a group and so a semigroup, 
(v) (R, *) is a semigroup but not a group, (vi) (Q, *) is a semigroup but not a group, (vii) 
(G, *) is a group and so a semigroup, (viii) ( G , *) is a group and so a semigroup. 

2 . ( 0 , 1 ) is the identity and (—f, £) * s the i nverse of ( a,b ). 

5. n — 7. 

6 . [ 6 ] = [ 8 ] and [ 6 ] G Ug. 

7. n ~ 2. 

8 . Ue = {[ 1 ], [5]}; Us = {[ 1 ], | 2 ], [4], [5], [7], ( 8 ]}; U 12 = {[ 1 ], [5], [7], [ 11 ]}; [7 24 = {[ 1 ], 
[5], [7], [11], [13], [17], [19], [23]}. 

9. For all 0 < a < p, gcd(o,p) = 1. Thus, for all 0 < a < p, [a] G U p . Hence, 

u p = Zp\{[ 0 ]}. 

12. Note that a 2 = e implies that a = a 1 . By using a * b A * a = b 7 , first show that 
f 9 - 6 16 . 

17. Suppose G is commutative. Let a,b G G. Now (a * 6) _1 = {Y 1 * a -1 = a -1 * b~ l . 
Conversely, suppose (u * 6) -1 = a -1 * 6 -1 for all a,b G G. Let a, b G G. Then (a * b)~ l = 
a -1 * 6 _1 , which implies that ((a * 6) -1 ) -1 = (a -1 * fe -1 ) -1 , i.e., a * b = b * a. Hence, G is 
commutative. 

19. Let a, b G G. Suppose (a* b) 1 = a* *b l , (a*b) z+l = a l+1 *b l+1 , ( a * b) z+2 = a 1+2 *b z+2 . 
Now a 1+1 * £> l+1 = (a * b ) l+1 = (a * 6 )(a *b) 1 = a*b* a 1 *b l , which implies that a 1 *b — b*a l . 
Also, a l+2 *b l+2 = ( a*b ) l+2 = (a* 6 )(a*b ) l+1 = a* 6 *a l+1 which implies that a I+1 *b = 

b * a t+1 . Hence, b * a 1 "*" 1 = a 1+1 *b—a* a* *b=a*b*a x and this implies that b * a = a * b. 
Hence, G is commutative. 

21 . x is unique. 

22. Use induction on n. 

23. Consider the set {o n | n G N} C G. { a 71 | n G N} has finitely many elements. 

24. Use induction on n. 

26. Suppose |G| = n. Note that { e,a,a 2 ,... ,a n } C G and G has n elements. 

28. (ii) Suppose that a * b — ^ 2n + 3 * a 2n+1 , where n > 1 is an integer. Then o (6 * a -1 ) = 
o(6 2n+3 * a 2n_1 ) = o(6 2n+1 * a 2n+1 ). 

29. Use Theorem 2.1.28. 

31. 

m 

32. (ii) Use induction and part (i). 

“Pufuc. Mai/La##ia£lea/ 
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33. Use induction and Worked Out Exercise 4 (page 74). 

34. First note that (1,0) is the identity element of G and (£,—~) is the inverse of 
( a,b ) 6 G. Now proceed as in Worked Out Exercise 2 (page 72). 

36. Let b G S. Now {b , b 2 , . ..} C S. Since S is finite, there exist integers m and n, 
m > n such that b m - b n . Let k = m-n. Then b n+k = b n . Now b 2n+k = b n b n+k = b n b n - 
b 2n . By induction, b Tls+k = b ns for all positive integers s. Also, b ns+2k = b ns+k b k = b sn b k = 
yns+k _ j ,ns jj encej by induction b ns+kr = b ns for all positive integers s and r. Thus, b 2nk = 
b nk+nk = b nk Let a = b nk .Then a 2 = a. 

41. Use induction. 

42. (i) False (ii) False (iii) True (iv) True (v) False (vi) False. 


Exercises 3.1.2 (page 96) (Permutation Group) 

1. (i) (1 3 4) o (2 5 6 ) = (1 4) o (1 3) o (2 6 ) o (2 5). (ii) (1 3) o (4 5). 

2 . (5 4 6 ). 

3. (2 4 8 ) o (3 5 6 ). 

4. (2 16 7). 

5. (3 9 7) o (5 4 1 2). 

6. (1 2) o (5 6 ) o (7 8 ). 

7. Let qi = (1 6 ), a 2 = (1 5), 03 = (1 4), 04 = (1 3), 05 = (1 2). Let j3 = ai o ct 2 ° 03 o 
0 : 4005 . Show that (3(i) = a(i) for all i. 

8 . 6 . 

13. A 4 = {e, (1 2 3), (1 3 2 ), (2 3 4), (2 4 3), (1 3 4), (1 4 3), (1 2 4), (1 4 2 ), (1 2 ) o (3 
4), (14) o(3 2), (13)o(2 4)}. 

15. Let H be the set of all odd permutations in S n . Then S n = A n UH and A n OH = 0. 
Hence, |5 n | = |A™| + |H| . First show that there exists a one-one function from A n onto H. 
Therefore, \A n \ = |//| . Now use the fact that J5A| = n\ and |S n | = \A n \ + \H\ . 


Exercises 4.1.2 (page 107) (Subgroups) 


2 . (i) Since 


1 0 
0 1 


G S, S 7 ^ 0. Let 


a b 
c d 


g h 

U V 


€ S. Now 


9 h 
u v 


n -1 


v —h 
-u g 


. Thus, 


a b 


_ 1 

-1 

a b 


o 


U V 


"3 

a 

_ t 

> 


v —h 

-u g 


av — bu — ah + bg 
cv — du —ch + dg 


Since ( av — bu)(—ch + dg) — (—ah + bg)(cv — du) = 1 
S' is a subgroup. 

^ G S, S 7 ^ 0. Let 


a b 


1 - 

1 _ 

c d 


- 1 

1 


-1 


(iv) Since 


0 1 


a 

6 


0 

d 

J 


c 

0 


e 

/ 


G S. Hence, 

G S. Then ad ^ 0 and 


L 

Now 

j 



L 


j 

L 


J 

| 

a b 


c 

e 

-1 

a 

b 


1 

_e_ 

cf 


0 d 


0 

/ 

r 


0 

d 

~i 


0 

c 

cf 


c/ 

0 


C J 

dc 


cf 


Now 4^7 = 7 % ^ 0- Hence, 

<=/ cf (c/) 2 7- 1 


p-Q 

<3 

1 _ 


c e 

0 d 


_ ° / 


1 -1 


and so S' is a subgroup. 


4. Note that (0,1) is the identity element and (—f, r) is the inverse of (a, b) G G. 
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(ii) Since (0,1) G K, K ^ 0. Let (a, 6 ), ( c,d ) G K. Thus, 6 > 0 and d> 0 , which implies 
that ^ > 0. Hence, (a, 6 )(c, d ) -1 = (a, 6 )(— = (a— G AT. Thus, AT is a subgroup, 

(iv) Elements of order 2 are of the form, (a, —1), where a G R. 

5. No. ' 

6 . (4,6) = (2). 

7. (4,5) = Z. 

8 . (i) Let a = ^ ^ 2 1 ) and 6 = ( 2 1 4 3 ) ' Then ^ = 6 ’ 

(ii) (h.,v) = {r 360 , h , t», r 180 }. 

9. (i) o(a) = 4 and 0 ( 6 ) = 2. (iii) H = {e, a, a 2 , a 3 , 6 , 6 a, 6 a 2 , 6 a 3 }. (iv) |H| = 8 . 

10 . (ii) (a 2 6) 2 = a 2 ba 2 b — a(a 6 a)a 6 = abab = (a 6) 2 = e. 

13. Note that o( 6 a 6 _1 ) = o(a) for all 6 G 67. 

15. ea — a — ae. Thus, e G 67(a) and hence 67(a) 7 ^ 0. Let 6 ,c G 67(a). Now a 6 = 6 a 
implies that a 6 _1 = 6 a -1 and so 6 _1 G 67(a). Also, a( 6 c) = (a 6 )c = ( 6 a)c = 6 (ac) = 6 (ca) = 
( 6 c)a. Thus, 6 c G 67(a). Hence, 67(a) is a subgroup. Let x G Z(G). Then ax — xa for all a G 67. 
Thus, x G 67(a) for all a G 67 and so Z(G) C n a£ G67(a). Conversely, let x G n a 6c67(a). Then 
x G 67(a) for all a G G. Thus, xa — ax for all a G G, which implies that x G Z(G). Hence, 
Z(G) = n a&G C(a). 

18. ( H) = f]{K | AT is a subgroup of G such that H C AT} = H. 

21. (i) Note that (1 2 3 • • • n)o(i z+l)o(l 2 3 ■ ■ • n) -1 = (i+1 z + 2) for alH = 1,. . ., n — 2. 

22. Suppose Q =<^j-, ■ • ■, >, where gcd(pi,<ji) = 1 for all i. Let q = lcm(qi,< 72 , 

. . . , q n ). Then q = qiXi for some rt G Z, 1 < i < n. Now ^ = p^zy^ G . Hence, 

Q = (g) • Now ^ G Q. There exists k G Z such that ^ , which implies that ^ = A: G Z, 

a contradiction. Hence, (Q,+) is not finitely generated. 

23. Note that, if |G| = n, then \P{G)\ = 2 n . 

24. No. 

25. (i) True (ii) True (iii) False (iv) False (v) False (vi) False (vii) True. 


Exercises 4.2.2 (page 114) (Cyclic Groups) 

1 r,\ 5 \ _ „5 10 15 20 251 /„ 2 \ _ „2 4 „8 10 „12 14 „16 

1 . f i J (a ) — {e, a , a ,a , a , d j. ^ii j ( q. ) — a, a, a, a,a , , o , 

„18 _20 „22 24 _26 ,,281 

a , a , a ,a ,a ,a J. 

2. Two elements of order 6 and four elements of order 5. 

3. Use Worked Out Exercise 3 (page 114). 

4. (i) (Q,+) is a subgroup of (R,+). Every subgroup of a cyclic group is cyclic. Thus, 
since (Q, +) is not cyclic, (R, +) is not cyclic. 

7. Consider Z 2 x Z 2 . 

11. Let G be the set of all 2 x 2 nonsingular matrices over R. Let H be the cyclic 
’ -1 0 
0 -1 

Ise (iii) True (iv) False (v) False. 


subgroup generated by 
13. (i) True (ii) Fa 


Exercises 4.3.2 (page 125) (Lagrange’s Theorem) 

1. (i) The right cosets of H are H , H( 1 2), and H( 1 3). (ii) (e, (1 2)}. 

2. Let H = 6Z. The right cosets of H are H + 0, H + 1, H + 2, H + 3, H + 4, and H + 5. 


3. Write a = 


12 3 4 

2 14 3 


6 


12 3 4 

3 4 12 


and c = 


12 3 4 
4 3 2 1 


Since e G H, H 7^ 0. Now x 2 = e for all x G H. Thus, x _1 = x G H for all x G H. Also, 
a6 = 6a = c G H, ac = ca = b G H and be = c6 = a G H. Hence, H is a subgroup of 67. 

4. H, rgoH, r l80 H, T270H are the left cosets and H, Hrg 0 , Hr iso, Hr27o are the right 


cosets. 
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6 . ((1 2 3 4)) = {e, (1 2 3 4), (1 3) o (2 4), (1 4 3 2)}; ((1 3 2 4)) = {e, (1 3 2 4), (1 

2) o (3 4), (1 4 2 3)}; ((1 3 4 2)) = {e, (1 3 4 2), (1 4) o (2 3), (1 2 4 3)}; {e, (1 2), (3 4), 
(1 2) o (3 4)}; {e, (1 4), (3 2), (1 4) o (3 2)}; {e, (1 3), (2 4), (1 3) o (2 4)}; {e, (1 2) o (3 

4), (1 4)o(3 2), (1 3) o(2 4)}. 

7. 



d 

a 

b 

c 

d 

d 

a 

b 

c 

a 

a 

c 

d 

b 

b 

b 

d 

c 

a 

c 

c 

b 

a 

d 


11. Let G be the group of symmetries of the square and, H denote the subgroup { 7 - 360 , h}. 

14. Consider H = {a £ S n | a(l) = 1}- 

15. Consider \HK\ . 

16. Suppose that H and K are two subgroups of order p. Use previous exercise to 
conclude that H = K. 

19. 175 

20. Find \AB \. 

23. (i) False (ii) False (iii) False (iv) True (v) True (vi) True. 

Exercises 4.4.2 (page 136) (Normal Subgroup) 

1. H is a normal subgroup. 

2. H is not normal. 

5. For all h G H, Hh = H , and for all x H, Hx = Ha. Thus, H is of index 2 in G. 
Hence, H is a normal subgroup. 

7. Replace Z(G) by H in Worked-Out Exercise 4.4.1 (page 136). 

9. (i) {E, 1 + E}. (ii) Q/Z ={f + Z | 1 < a < b}. 

12. Show that hkh~ l k~ l & H fl K for all h G H and k G K. 

14. Let G = {±1, ± 7 , ±j, ±fc}, where i 2 = j 2 = k 2 = —1, ij — k = —ji, jk = i = —kj. 

ki = j = —ik. 

17. Note that |ai7o -1 | = \H\ for all a £ G. 

19. Let hT be a subgroup of order 6. Since A\ has eight 3-cycles and \H\ = 6, there exists 

a 3-cycle, a say, such that a ^ H. Then a 2 = aC 1 ^ H. Let K = {e, a, a 2 }. Then K is a 
subgroup of A 4 such that \K\ = 3 and HC\K = {e}. Thus, \HK\ = = 6-3 = 18 > |A 4 |. 

a contradiction. Hence, A 4 has no subgroup of order 6. 

20. {<>}, {e, (1 2) o (3 4)}, {e, (1 4) o (3 2)}, {e, (1 3) o (2 4)}. {(1 2 3)). ((1 3 4)). 

((1 2 4)), ((2 3 4)), {e, (1 2) o (3 4), (1 4) o (3 2). (1 3) o (2 4}, and At. 

25. (i) True (ii) True (iii) False (iv) False (v) True. 


Exercises 5.1.2 (page 151) (Homomorphism) 

1. (i) / is a homomorphism. Ker / = {1}. (ii) / is a homomorphism. Ker / = {0}- 
(iii) / is a homomorphism. Ker / = {1, —1}- (iv) / is not a homomorphism, (v) / is a 
homomorphism. Ker / = {0}. 

2. There are two homomorphisms from Z onto Z. One is the identity homomorphism 
and the other maps 1 to —1. 

3. There are two homomorphisms from Z onto Z6- 

4. There are four homomorphisms from Zs to Z 12 and there are 10 homomorphisms 


from Z 20 to Z 10 . 

6. Suppose that (Q, +) ~ (R, +) and let / : Q —» R be an isomorphism. Then /(0) = 0. 
Let 0 * J 6 Q. Then /(f) =/(J+ £ + ■•■ +J) = /(J) + /(f) + ■■■ + /(£) = p/(J)• Nov, 
/(!) = /(f) = P/(f)• Hence, f(±) = -/(1). Thus, /(J) = 2/(1). Now 1 € R. Since / is onto 
there exists ~ E Q such that 1 = /(^). If m = 0, then 1 = f(^) = /(0) = 0, which is a 
contradiction. Hence, m. ^ 0. This implies that 1 = /(“) = 17 /( 1 ) and so /(1) = ~ € Q 

~PlOne. T4twLtit.f^Jf±€ljtLCjtLL 
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Hence, /( £ ) E Q for all 2 6 Q. Thus, irrational numbers have no preimage. Consequently, 

(Q. +) 9^ (R-i +)• 

10 . Consider the function /(a) = log for all a E G. 

11. (i) Ker / = {e, a 4 }, (ii) Ker / = {e, a 2 , a 4 , a 6 }. 

15. Ker / = {e} x H. 

19. Define ip : A —* B by ip{a) = /(a). Show that ip is one-one and onto B. 

21 . (i) True (ii) True (iii) False (iv) False (v) False (vi) True (vii) False. 

Exercises 5.2.2 (page 164) (Isomorphism and Correspondence Theorems) 

1. Define / : R* —► R + by /(x) = x 2 for all x E R*. Show that / is an epimorphism 
and Ker / = {1, —1}. 

3. Define / : 8 Z —> Z 7 by f(t ) = [t] for all t E 8 Z. Show that / is an epimorphism and 
Ker / - 56Z. 

4. Let G = (Z, +); A = 2Z and B — 4Z. 

7. The correspondence is given by Z(G ) —+ {e}, { 7 - 360 , nso, 6,, v } —* {e, 6}, { 7 - 360 , 7 - 90 , 
7*180, t*27o) —7 {e,a}, {r 36 o, 7*180, ^ 1 ,^ 2 } —> {e,c}, and G —> G\. 

12. The subcollections of isomorphic groups are {Z 2 , ^ 2 }, {Z6}, {*?6}, {(Z,+), (17Z,+), 
(3Z,+), (<i > , )}, {(Q,+)}, {(R,+),(R+. )}, {(Q*. •)>. {(C*,.•)}, and {(R',-)}. 

13. Use Worked-Out Exercise 7 (page 163). 

14. |Aut(Ze)| = 2. One automorphism is the identity mapping and other mapping [l] 
onto [5]. 

16. Now Z(S 3 ) = {e}. Since Sz/Z^Ss) ~ Inn(S , 3 ), Inn(5a) — S 3 . Hence, 6 = |^ 3 1 = 
|Inn(5s)| < |Aut( 53 )|- Now S 3 = (a, 0) , where a = (1 2 ) and f3 = (1 2 3). Let / G Aut( 53 ). 
Then / is determined if f(a) and f((3) are determined. Now o(/(a)) = o(a) = 2 and 
°(/(/?)) = o(/3) = 3. Since S 3 has three elements of order 2 and two elements of order 3, /(ct) 
has three choices and /(/3) has two choices. This shows that Aut(S > 3 ) has at most six elements. 
Since 6 < |Aut(5s)| , 6 = |Aut(S , 3 )|. Hence, Inn(i> 3 ) = Aut(Ss) and so Inn^) ~ 5 3 ~ 
Aut(S r 3 ). 

17. Inn(^) ~ 1 S 4 ~Aut(^). 

20. (ii) Consider G = Z 2 x Z 2 and H = {([0], [0]), ([1], [0])}. 

22. (i) True (ii) False (iii) True (iv) False (v) True (vi) True (vii) False 

Exercises 5.3.2 (page 171) (The Groups D 4 and Q&) 

1. Let H = {e , b, a 2 , 6a 2 } and K = {e, 6}. 

3. The homomorphic images are Da, Zo, Z 2 , and K 4 . 

Exercises 5.4.2 (page 178) (Group Actions) 

1. G l = {e, (2 3)}, C 2 = {e, (1 3)}, G 3 = {e, (1 2)}. 

8. Use Worked-Out Exercise 5 (page 177). 

9. Use Worked-Out Exercise 5 (page 177). 

• 11. Use Worked-Out Exercise 4 (page 177). 

12. Use Corollary 5.4.10. 

Exercises 6.1.2 (page 187) (Direct Product of Groups) 

3. Let x G G. Then there exists unique hi E Hi, 1 < i < n, such that x = 6 , 16.2 • • ■ h n . 
Define by /(x) = ( hiKi , 6 , 2 K 2 , ..., h n K n )- It is easy to verify 

that f is an epimorphism and Ker f = K. 

5. Define / : G —> H as follows: Let x G G. Then there exists unique h G H and 
k G K such that x — hk. Define /(x) = h. Clearly / is an epimorphism and Ker/ = K. Thus, 

~ H. Similarly, ^ ~ K. 

7. Use Worked-Out Exercise 6 (page 187). 
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8. Show that the mapping / : G —> ^ X ^ defined by /(a) = (Nh , Nk), where a = hk, 
is an epimorphism with kernel N. 

11 . Note that if C 4 is a cyclic group of order 4 and C 2 is a cyclic group of order 2, then 

C4 has an element of order 4 while C 2 x C 2 has no element of order 4. 

13. Yes. 

15. Suppose H and K are proper subgroups of D 4 such that D 4 = H x K. Then H 

and K are normal subgroups of D 4 and either \H\ = 4 and \K\ = 2 or \H\ = 2 and \K\ = 4. 

Suppose \H\ — 4 and \K\ = 2. Then H and K are commutative. Also, hk = kh. for all h & H 
and k 6 K. Now it follows that D 4 is commutative, a contradiction. 

16. No. 

17. Suppose Z is an internal direct product of its nontrivial subgroups H and K. If 
H = (n) and K = (m), then show that mn € H fl K. 

Exercises 7.1.2 (page 195) (Conjugacy Classes) 

5. H is normal in K if and only if xHx ~ 1 = H for all x £ K if and only if x E Ng(H) 
for all x G K if and only if K C Ng{H). 

7. Let a be a 5-cycle, f3 be a 4-cycle, 7 be a 3-cycle, <7 be a 2-cycle, 6 be a product of 
a 3-cycle and a 2-cycle and /r be a product of two 2-cycles. The conjugate classes are Cl(e), 
Cl(a), Cl(p), Cl(rr), 01(a), Cl(S), and Cl(/i). Also,|Ci(e)| = 1, \Cl(a)\ = 24, \Cl(p)\ = 30, 
\Cl(j)\ = 20, \CI(cj)\ = 10, \Cl( 8 )| = 20, and |C/( 7 )I = 15. Now [S 5 : C(a)] = \Cl(a)\ for all 
a G £ 5 . Hence, 

\S s \ - [5 5 : C(e)\ + [5s : C(a)] + [5 5 : C(J3)\ + [S 5 : C( 7 )] + 

+[5 S : C(a)\ + [5s : C(6)} + [5s : C(fi)} 

= |CZ(e) | + \Cl(a)\ + \Cl(P)\ + |CZ( 7 )| + \Cl(a)\ + \Cl( 8 )\ + |C/( 7 )| 

= 1 + 24 + 30 -|- 20 + 10 + 20 -(- 15 

= 120 . 


Exercises 7.2.2 (page 200) (Cauchy’s Theorem and p-groups) 

2. 6. 

4. Use induction on n. 

5. The 2-subgroups of Z 12 are {[0], [6]} and {[0], [3], [6], [9]}. {[0], [4], [8]} is the only 
3-subgroup of Zi 2 - 

6. ((1 2) o (3 4)), ((1 3) o (2 4)), ((14) o (23)), and {e, (1 2)o(3 4), (1 3)o(2 4), (1 4)o(2 
3)}- 

8. Use Cauchy’s theorem and Worked-Out Exercise 5 (page 177) 

12. Use Worked-Out Exercise 5 (page 199). 

13. (i) By Cauchy’s theorem, G has a subgroup of order 11, say H. Suppose K is any 

other subgroup of G of order 11. Suppose H ^ K. It follows that \H D K\ = 1. Hence, 
\HK\ = | = 11 1 11 = 121 > |G| , a contradiction. Hence, H = K and so H is unique. 

Since H is the only subgroup of order 11, if is a normal subgroup of G. 

(ii) Since |if| is prime, H is cyclic. Let H = (a) for some a 6 H. Let g E G. Then 
gag ~ 1 € H. Thus, gag _1 = a 1 for some i, 0 < i < 11. Clearly 0. We claim that 2 = 1. Now 
g 2 ag ~ 2 = g(gag~ 1 )g ~ 1 = ga l g ~ 1 = ( gag _1 ) t = (a 1 ) 1 = a * 2 . By induction, g r ag~ r = a^ 

n n .3 > 3 1 

Now G contains an element of order 3, say b. Then a = b ab~ = a 1 . Hence, a 1 _ = e. 
Since o(a) = 11, i 3 =n 1. By Fermat’s theorem, 2 10 =n 1. Thus, i =n 1. Since 1 < i < 10. 
we must have 2 = 1. Thus, gag~ l = a, i.e., ga = ag. Hence, H C Z(G ). 

Exercises 7.3.2 (page 209) (Sylow Theorems) 

2. Use induction on n. 

4. First show that if \G/Z(G)\ = 91, then G/Z(G ) is cyclic. 
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8 . Let x E G. Then xPx~ 1 C xHx~ l = H. Hence, xPx~ l is a Sylow p-subgroup of H. 
There exists h E H such that hxPx~ l h _1 = P. This implies that h.x E Nc(P)- Thus, hx — y 
for some y E Ng(P)> which implies that x = h~ 1 g E HNg(P)- Hence, G = HNg(P)- 

10. Since K is a p-subgroup there exists a Sylow p-subgroup P of G such that K C P. 
Let Q be a Sylow p-subgroup of G. Then Q = xPx~ x for some x E G. Since K is a normal 
subgroup, K = xKx~ l . Hence, K = xKx~ x C xPx~ x = Q. 

12 . Let K be a maximal normal subgroup of G such that K C H, (possibly K = {e}). 
Then \K\ = p r such that r < m. Now | ^ | = p m-r = p n , n > 0. Then Z(j^) is a nontrivial 
subgroup of Since Z(j^) is a normal subgroup of G, Z(j^) = , where L is a normal 

subgroup of G such that K C L. Since > 1, K ^ L. Since K is a maximal normal 

subgroup of G such that K C. H, L ^ H. Let a E L be such that a £ H. We now show that 
aHa~ l = H. Let b E aHa~ l . Then b = aha -1 for some h, E H. Since Ka E L/K = Z(j^), 
KaKh = KhKa. Thus, aha~ 1 h ~ 1 = ( ah){ah)~ x E K C H. Hence, aha ~ 1 = aha~ 1 h~ l h, E H. 
Thus, aHa* 1 C H. Since \aHa~ l \ = \H\ , aifa " 1 = H. 

Exercises 7.4.2 (page 219) (Some Applications of Sylow Theorems) 

1. For order 20, show that the group has a unique Sylow 5-subgroup. For order 28, show 
that the group has a unique Sylow 7-subgroup. For order 36, show that the group either has 
a unique Sylow 3-subgroup or a normal subgroup of order 3. 

2. Use Sylow’s first theorem and Worked-Out Exercise 5 (page 199). 

6 . As in Worked-Out Exercise 2 (page 217), show that either G has a normal subgroup 
of order 32 or a normal subgroup of order 16. 

7. Use Corollary 7.4.12. 

9. The number of Sylow 5-subgroups is 1 + 5k such that 1 + 5&|7 ■ 19. Thus, k = 0 
and hence, G has a unique Sylow 5-subgroup, H say. Thus, H is normal. Similarly, G has 
a unique Sylow 7-subgroup, K say, and a unique Sylow 19-subgroup, L say. Thus, K and L 
are normal subgroups. Clearly H, K and L are cyclic groups, H fl K = {e}, K 0 L — {e}, 
and L fl H = {e}. Let H = (h) , K = ( k) and L = (l) . Since hk = kh,, hi = Ih and kl = Ik , 
o (hkl) = o(h) o (Jfe) o (0 = 5 • 7 ■ 19 = |G|. Thus, G = (hkl). 

15. (i) The number of Sylow 7-subgroups of G is 1 + 7k such that 1 + 7A124. Thus, k = 0 
or 1. If k = 0, then G has a unique Sylow 7-subgroup, which must be normal. This is a 
contradiction since G is simple. Thus, k — 1 . Then G has eight Sylow 7-subgroups. 

(iii) Let AT be a subgroup of order 14 in G. Now \K\ = 14 = 2 • 7. The number of Sylow 
7-subgroups of K is 1 4 - 7fc such that 1 + 7k\2. Thus, k — 0. Hence, K has a unique Sylow 
7-subgroup, say, P, which is normal in K. Now P is also a Sylow 7-subgroup of G. Since for 
all a E K, aPa~ 1 = P, K C Ng(P)■ This implies that 14 |]Ag(P)| — 21, a contradiction. 
Hence, G has no subgroup of order 14. 

18. Let G be a group of order 70. Let H be a Sylow 7-subgroup of G and A be a Sylow 
5-subgroup of G. Then H and K are unique and hence normal. Also, H, K, and HK are 
cyclic subgroups. HK is a normal subgroup of G, |H| = 7, \K\ = 5, and \HK\ = 35. Let 
HK = (a) Then o(a) = 35. Let b E G and o(£>) = 2. Now b~ l ab E HK. Thus, b~ 1 ab = a T 
for some r, 1 < r < 34. Now G = HK U bHK. From this, it follows that every element of 
G is of the form b s a r , where s = 0 or 1 and 0 < r < 34. Now b~ l ab — a T implies that 
o(a r ) = o(a) = 35. Thus, gcd(r, 35) = 1. Now a = b(b~ 1 ab)b~ 1 = a T . Therefore, 35](r 2 — 1). 
Hence, 5|(r 2 — 1 ) and 7|(r 2 — 1), 1 < r < 34. Now it follows that the only possible choices of 
r satisfying the above conditions are r — 1 , 6 , 29, 34. Thus, there are four groups of order 70. 

Case 1 : r = 1 . Then 6 _ 1 ai> = a, i.e., ab = ba. In this case, G is commutative and it is 
easy to verify that G ~ Z 70 . 

Case 2 : r = 34. Then b~ 1 ab = a 34 = a^ 1 , i.e., ab = ba~ l . Thus, G = (a, b) such that 
a 30 — e — b 2 and ab = ba ~ 1 . Hence, in this case G ~ P> 35 - 

Case 3: r = 6 . Then b -1 ab = a 6 . Thus, ab = ba 6 . Clearly o (a k ) 7 ^ 2 for all k, 0 < k < 34. 
Let ba k be an element of order 2. Then ( ba k ) 2 = e implies that a 7k = e. Hence, 35|7&, which 
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implies that 5|fc, 0 < k < 34. Thus, k = 0, 5, 10, 15, 20, 25, 30. Hence, elements of order 2 
in G are b, ba 5 , ba 10 , ba 15 , 6 a 20 , ba 25 , ba 30 . Let u — a 5 and S — (u, b) . Then u 7 — e = b 2 , 

b~ 1 ub = ba 5 b — a 30 — u G = it -1 . From this, it follows that 5 ~ D 7 . Now 

fci 5£ — k 1 6k 5 t — k i 5(fc+t) _ n 

a ba a = ba a = ba v ' 6 b 

and 

ba k ba 5 t (ba k )~ 1 = bba 6 k a 5t ~ k b = ba 30(k+t) G 5. 

Also, a k a 5 t a~ k = a 5t G 5 and 6 a fc a 5 t ( 6 a fc ) -1 = ba k a 5t ~ k b = a 30t G 5. Hence, 5 is a normal 

subgroup of G. Let v = a 7 and T = (v). Then T is a subgroup of G an,d T ~ Z 5 . Now T 

is a normal subgroup of G (since T is a Sylow 5-subgroup of G). Clearly S f)T = {e} and 

|G| = \ST\ . Thus, S and T are normal subgroups oi G, S<1T = {e}, and |G| = |5T| . Hence, 

G = 5 x T ~ I >7 x Z 5 . 

Case 4. r = 29. As in Case 3, we can show that G — D 5 x Z 7 . 

21. Let G be a group of order 14. Then G is cyclic or G ~ D 7 . 

23. If n is odd, then Z{D n ) — {e}, and if n is even, then Z(D n ) = { e , r? }. 

24. The conjugacy classes in L> 2 n+i are 

{e}, {b, ba, ba 2 ,.. ., ba 2n }, {a r , a~ r }, 1 <t <n. 

The conjugacy classes in D 2 n are 

{e}, { 6 ,faa 2 , ..., ba 2n }, {ba, ba 3 , ..., 6 a 2n-1 }, 

{a r , a -7- }, 1 < r < n — 1 , {a n }. 

25. (ii) Suppose n p = q 2 . Then pk + 1 = q 2 for some A: > 0. This implies p\(q 2 — 1). Thus, 
either p\(q + 1) or p\(q — 1). Since p > q, p / (q — 1). Hence, p\(q + 1). Thus, q + 1 > p > q. 
This implies that p — 3 and q = 2. 

29. (i) False (ii) True (iii) True (iv) False. 


Exercises 8.1.2 (page 237) (Solvable Groups) 

1. Let Hi = {riso, ^ 360 , h,v}, H 2 = {r 3 60 ,h}, H 3 — {riso, r- 3 6o, d 2 } and Ha = 

{r 3 6o,di}. Note that \G/Hi\ = 2 = \Hi/H 2 \ - \G/H 3 \ = \H 3 /H A \ • 

2. Let H — (66) . The composition series is 

Z/H D 2Z /H 3 6Z /H 3 66Z /H = {H}, 

Z /H 3 2Z/H 3 22Z/H 3 66Z /H = {H}, 

Z/H 3 3Z/H 3 6 Z/H 3 66Z /H = {H}, 

Z/H 3 3Z /H 3 33Z /H 3 66Z /H = {H}, 

Z/H 3 11Z/H 3 22Z/H 3 66Z /H = {H}, 

Z/H 3 11 Z/H 3 33Z /H 3 66Z /H = {H}. 

4. (i) S 3 3 A 3 3 {e}. 

(ii) Let K = {e, (1 2) o (3 4), (1 3) o (2 4), (1 4) o (2 3)}, H x = {e, (1 2) o (3 4)}, H 2 = {e, 
(1 3) o (2 4)} and H 3 — {e, (1 4) o (2 3)}. The composition series of A 4 is 

I 4 3if3//i3{ej, 

A 4 3 K 3 H 2 3 {e}, 

A 4 3 K 3 H 3 3 {e}. 

(iii) Let K, Hi, H 2 and H 3 be as in (ii). The composition series of S 4 is 


S 4 3 A 4 3 K 3 Hi 3 {e}, 
S 4 3 A 4 3 K 3 H 2 3 {e}, 
S 4 3 A 4 3 AT. 


~Pu.fi.CL 
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(iv) The composition series of Z 2 x Z 2 is 

Z 2 x Z 2 3 Z 2 x {[0]} 3 {([0], [0])}, 
Z 2 xZ 2 D{[0}}xZ 2 D{([0],[0])}, 

Z 2 xZ 2 3{([0],[0]), ([1M1])}d{([0],[0])}. 

5. Let G be a finite group. Since G is finite, there exists a maximal normal subgroup G 1 
of G. Thus, G/Gi is simple. If G 1 7 ^ {e}, then since G 1 is finite, there exists a maximal normal 
subgroup G 2 of G\. Then G\jG r i is simple. If G 2 7 ^ {e}, then continuing as before, we obtain 
the following series G = Go 3 Gi 3 • • - 3 G n 3 ■ • • such that Gi/Gi+i is simple for all i. Since 
G is finite, there exists n > 0 such that G n = {e}. Thus, G = Go3Gi3---3 G n — {e} is 
a composition series. 

12. If p = q, then G is a p-group and so solvable. Suppose p ^ q. Show that G has a 
unique Sylow p-subgroup or a unique Sylow p-subgroup. Let H be a Sylow p-subgroup and K 
be a Sylow g-subgroup. Then |if| = p 2 and \K\ = q. Clearly both H and K are commutative 
and hence solvable. Suppose H is normal. Then |^-| = q. This implies that ~ is a cyclic 
group and hence solvable. Since H and jj are solvable, G is solvable. Now suppose K is 
normal. Then | -p- j = p 2 . This implies that ^ is a commutative group and hence solvable. 
Since K and ^ are solvable, G is solvable. 

14. A solvable series for S 3 x S 3 is S 3 x S 3 3 S 3 x A 3 3 A 3 x A 3 3 A 3 x {e} 3 {e} x {e}. 

15. If G = {e}, then the result is trivially true. Suppose G 7 ^ [e}. Suppose G is not 
commutative. Then G' 7 ^ {e}. Since G' is a normal subgroup of G and G is simple, G — G'. 
Thus, G ^ = G ^ {e} for all positive integers n. However, since G is solvable, there must 
exist a positive integer n such that G ^ = {e}, a contradiction. Hence, G is commutative. 

18. Suppose G is solvable and H 7 ^ {e} is a subgroup of G. Then H is solvable and hence 
H' 7 ^ H by Worked-Out Exercise 2 (page 234). Conversely, suppose that H' 7 ^ H for any 
subgroup H ^ {e} of G. Then G / G'. Thus, G D G'. If G {n) ^ {e}, then G {n) ^ G (n+1) , 

i.e., G {n) 3 G (n+1 L Hence, we have the following strictly descending chain of subgroups: 

G 3 G' 3 • • • 3 G (n) 3 G (n+1) 3 • • • . 

Since G is finite and H' 7 ^ H for any subgroup H 7 ^ {e} of G, there must exist a positive 
integer n such that G ^ = {e}. Hence, G is solvable. 

21 . Note that ^ ~ Tnfi- 

22. (i) False (ii) False (iii) True (iv) False (v) False (vi) False (vii) False. 

Exercises 8.2.2 (page 244) (Nilpotent Groups) 

3. D n = (a, 6 ) , where o(a) = n, 0 ( 6 ) = 2 , and ab — 6 a _1 . Suppose D n is nilpotent. 
Also, suppose n = 2 m k, where k is an odd integer. Let u = a 2 . Then o(u) = k. Now 
ub = a 2 b = ba~ 2 = 6 u _1 . Let H = (u, b) . Then H ~ Dk . Now \H\ = 2 k and H is 
nilpotent. Let K ~ {e, 6 }. Then K is a Sylow 2-subgroup of H. Since H is nilpotent, K is 
normal. But then ubu = bu~ 2 £ K, a contradiction. Hence, n = 2 m for some positive 
integer m. 

5. No. 


Exercises 9.1.2 (page 257) (Finite Abelian Groups) 

2. Order 9 : Z9 and Z3 © Z3. 

Order 16 : Z16, Ze © Z 2 , Z4 © Z4, Z4 © Z 2 © Z 2 , and Z 2 © Z 2 ffi Z 2 © Z 2 . 

Order 27 : Z 2 7j Z9 © Z3, and Z3 © Z3 © Z3. 

Order 32 : Z32, Zi6 © Z 2 , Zg © Z4, Zg © Z 2 © Z 2 , Z4 © Z4 © Z 2 , Z4 © Z 2 © Z 2 ffi Z 2 , and 

Z 2 © Z 2 ffi Z 2 ffi Z 2 © z 2 . 

3. Order 15 : Zis- 

4. Order 60 : Z4 ffi Z3 ffi Z5 and Z 2 ffi Z 2 ^^Z3 © Z5. 
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Order 80 ■ ZieffiZ 5 , ZgffiZ 2 ©Z 5 , Z 4 ©Z 4 ©Zs, Z 4 ©Z 2 ©Z 2 ©Z 5 , and Z 2 ©Z 2 ffiZ 2 ©Z 2 ffiZ 5 . 
Order 240 : Zi 6 ©Z 3 ©Z 5 , Z 8 ©Z 2 ©Z 3 ©Z 5 , Z4©Z4©Z3©Z5, Z 4 © Z 2 © Z 2 © Z 3 © Z 5 , 
and Z 2 ® Z 2 © Z 2 © Z 2 © Z 3 © Z 5 . 

Order 540 : Z4 © Z27 © Z5, Z2 © Z2 © Z27 © Z5, Z4 © Z9 © Z3 © Z5, Z2 ffi Z2 © Z9 ffi Z3 ffi Z5, 
Z4 © Z3 © Z3 © Z3 © Z5, and Z2 © Z2 © Z3 © Z3 © Z3 © Z5. 

(i) 2 2 ,2 3 , 2 4 ,3,3 2 . (ii) 2,2,2 3 ,3,3,5,5,5. 

10. Zp3 © Zg2 , Zp2 © Zp © Zg2 , Zp © Zp © Zp © Zg2 , Z p 3 © Zg ® Zg, Z p 2 © Zp © Zg © Zg 
and Zp © Zp © Zp ffi Zg © Zg, 

11. Z 4 ffi Z 2 © Zg and Z 4 ffi Z 2 ffi Zg © Z 3 . 

15. Only two elements of order 3. 

IT. Zei and Z 27 © Z 3 . 

18. (i) True (ii) False (iii) True. 

Exercises 9.2.2 (page 267) (Finitely Generated Abelian Groups) 

3. |T(G)| = 252. 

4. The torsion coefficients are 30 and 60 and the betti number is 2. 

5. The elementary divisors are 2, 2 4 ,3, 3, 5,11; d\ — 6 and c /2 = 2640. 

7. (i) No (ii) No. 

13. (i) False (ii) False (iii) False. 


Exercises 10.1.2 (page 282) (Elementary Properties) 

I. (i) [1], [3], [5], and [7] are the units of Zs. [1] and [5] are the units of Z 6 - (ii) [2], [4], 
and [ 6 ] are the nilpotent elements of Zs. Z 6 has no nonzero nilpotent elements, (iii) The zero 
divisors in Zg are [2], [4], and [ 6 ]. The zero divisors in Z 6 are [2], [3], and [4]. 

3. (i) a + ( — l)a = 1 ■ a + (—l)a = (1 + (—l))a = 0 • a = 0. Hence, (—a) + (a + (—l)a) = 
0 + (—a) = —a. This implies that ((— a) + a) + (— l)o = — a, i.e., 0 + (— l)a = —a. Hence, 
(—l)a — —a. Similarly, a( —1) = —a. Now (—1)(—1) = —(—1) — 1 since 1 and —1 are additive 
inverses of each other. 

5. Suppose R is commutative. Let a,b € R. Then (a + b)(a — b) = a(a — b) + b(a — b) = 
aa — ab + ba — bb = a 2 — ab + ab — b 2 (since ab = ba) = a 2 — b 2 . Conversely, suppose that 
a 2 — b 2 — (a + b)(a — b ) for all a,b G R. Let a,b G R. Now (a + b)(a — b) = a 2 — ab + ab — b 2 . 
Hence, a 2 — ab + ba — b 2 = a 2 — b 2 and this implies that — ab + ba = 0 and so ba ~ ab. 

7. Use induction on n. 

8 . Use induction. 

9. Use Exercise 7 (page 282) and the fact that (^)x = 0 for all x G R and for all 
1 < r < p — 1 since the characteristic of R is p. 

II. Suppose a is nilpotent. There exists a positive integer n such that a n = 0. Now 


(1 — d)(l + a + • - * + & n ) = 1 + a + ■ ■ ■ + a 71 — a — a — ■ • • — a 71 — a 71 = 1 — a 71 = 1 — 0 = 1. 

16. Since gcd (m,n) = 1, there exist integers r and s such that 1 = mr + ns. Thus, 
= (a m ) r (a n ) s = (b m ) r (b n ) s = b mr b ns = b mr+ns = b. 


~ _mr „ns 

a — a ~ a a 


20. Note that 


if and only if n 


False (x) False. 


1 

0 


0 

1 


is the identity of M 2 (R) and for any positive integer n, n -1 = 0 


1 

0 


' 0 

0 ~ 

0 

1 


0 

0 

e (ii) 

Fal 

se (iii) False 


. Hence, R and M 2 (R) have the same characteristic, 
se (iii) False (iv) True (v) False (vi) True (vii) False (viii) False (ix) 


Exercises 10.2.3 (page 287) (Some Important Rings) 

3. Every nonzero nonunit is a zero divisor. 

5. Let R be a regular ring with 1. Let a G R and a/0. Suppose a is not a zero divisor. 
There exists b £ R such that aba = a. This implies that a(ba — 1) = 0. Since a is not a zero 

riLhjC- T^oiAoftai^Lca/ 
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divisor, 6a — 1 = 0 and so ba = 1. Similarly, ab = 1. Hence, a is a unit. 


Exercises 11.1.2 (page 293) (Subrings and Subfields) 


1. (i) Clearly T\ ^ 0. Let 


a b 
0 c 


a b 
0 c 


d e 
0 / 


G T\. Now 


d e 
0 / 


a — d b — e 
0 c-f 


a b d e ad ae + bf 

and „ _ , = <• £11. Hence, is a subrmg. 

0 c 0 / 0 cf 

2. (i) No (ii) No (iii) Yes. 

4. Clearly T ^ <p. Let nl,ml G T. Then nl — ml = (n — m) 1 G T and (nl)(ml) = 
nml G T. Hence, T is a subring. 

7. To show that Z[\/2] is not a subfield of R. Assume (y/2)~ l G Z[\/2] and obtain a 
contradiction. 

11. Since 0 P = 0, 0 G T and so T ^ f Let a, 6 G T. Since F is commutative, (a — b) p = 
a p —pa p-1 6 + - • • + (—l) i ( p )a p_i 6 i + - ■ ■ + (—l) p-1 pa6 p-1 + ( —1) P 6 P . Also, sincep|( p ), ( p )x = 0 
for all x G F and for all 1 < i < p — 1. Hence, (a — 6) p = a p + ( —1) P 6 P — a~ b since if p = 2, 
then 1 = —1, and if p is odd, then (—l) p = —1. Hence, a — b G T. Suppose 6^0. Then 
( ab~ 1 ) p = a p (b p )~ 1 = ab^ 1 and so ab -1 G T. Since 0,1 G T, T has at least two elements. 
Hence, T is a subfield of F. 

14. Since a0 = 0 = 0a for all a G R, 0 G C(R) and so C(R) / 0. Let a, 6 G C(R). 
Then ax = xb and bx = xb for all x G R. Now x(a — b) = xa — xb — ax — bx = (a — b)x and 
x(ab) = ( xa)b — (ax)b = a(xb) — a(bx) = (ab)x. Hence, a — 6, ab G C(7i). Thus, (7(7?) is a 
subring. 

16. The elements in the center of M 2 (R) are of the form ^ ^ , where a G R. 

23. (i) False (ii) False (iii) False (iv) True (v) True (vi) True. 


where a G R. 


Exercises 11.2.2 (page 306) (Ideals and Quotient Rings) 

_ 6 1 |"o d 1 _ T r a u ] 


1. (i) Clearly I ^ 0. Let 


0 e 


0 d 
0 e 


0 b-d 
0 c ~ e 


0 v 


0 6 
0 c 


0 ^ J 

0 ab + uc 
0 vc 


GT 2 (Z).Then 


0 6 
0 c 


G 7, and 


Hence, 7 is an ideal. Now 


0 6 
0 c 


0 v 


Thus, T 2 (Z)/7 = 


a 0 
0 0 


0 v 


+ 1 = 


a G Z j- 


a 0 
0 0 


0 bv 
0 cv 

+ 1 + 


0 u 
0 v 


+ 1 = 


a 0 
0 0 


2. Z24// = {/, [1] + 7, [2] + /, [3] + /, [4] + /, [5] + 7, [6] + I, [7] + /}. 

3. Since 0 = 0 + 0\/—5 G I, I ^ 0. Let a + 6\/—5, c + dyf ^5 G 7. Then a — b and c — d 

are even and so (a — c) — (b — d) = (a — b) — (c — d) is even. Thus, (a + 6\/—5) — (c + d\/~5) = 
(a — c) + (6 — d)y/— 5 G I. Let x + y^/~5 G Z[V — 5]. Then (a + b\/—5)(x + y\j— 5) — (ax — 
5by) + (ay + 6x)\/—5- Now (ax — 56y) — (ay + bx) = (a — b)x — (a — b)y — 66y, which is even. 
Thus, (a + 6\/—5) (x + yy/^ 5) G 7. Since Z[y/^5] is commutative, (x + yy/^5)(a + 6>/—5) G 7. 
Hence, 7 is an ideal. 

5. Since 0 = 00 G AB, AB ^ 0. Let x,y G AB and r G R. Then x — a l b l 

and y = c j^j> where a z ,Cj G A and bj,d j G 7? for all i and j. Clearly x — y = 

IRiLfuc. 7^o^Lc#fta£uiaZ n 
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X^r=i ai h - YlJLi C 3 ^J e AB. Also, rx = r(^2i =1 a ib 0 = € AB since tch £ A 

for all i, and xr = (X^=i °i^) r = X^=i a i(bi T ) € AB since f>ir € B for all z. Hence, AB is 
an ideal. 

7. Use Theorem 11.2.11 and Exercise 6 (page 306). 

11. (ii) Since m\q and n\q, q £ (m.) and q £ (n) . Hence, (q) C (m) D (n) . Let a £ 
(m.) D (n) . Then m.|a and n\a. Since q is the 1cm of m and n, q\a and so a € (q) . Therefore, 
(m) D (n) C (q) . Consequently, (m) Pi (n) = (q) . 

12 . {0} x {0}, Fi x {0}, {0} x F 2 , and Fi x F 2 are the only ideals of Fi x F 2 . 

13. (ii) Yes. 

16. Z has no zero divisor; ( 6 ) is a nonzero ideal of Z. Since (2 + (6))(3 + ( 6 )) = 6 + ( 6 ) = 
( 6 ) , 2 4- ( 6 ) 7 ^ ( 6 ) , and 3 + ( 6 ) ( 6 ) , it follows that Z/ ( 6 ) has zero divisors. 

18. ann/ = {[ 0 ], [ 10 ]}. 

19. ann/ = {0}. 

21. Let / be an ideal of a regular ring R. Let a £ I. There exists b £ R such that a = aba. 
Since / is an ideal, x = bab £ I. Now axa = a(bab)a = ( aba)ba = aba = a. Hence, / is regular. 

27. (i) False (ii) True (iii) False. 

Exercises 11.3.2 (page 315) (Homomorphisms and Isomorphisms) 

2 . Let a, b,c,d £ Z. Suppose that a = c and b = d. Then a + b — 1 — c + d — 1 and 
a + b — ab = c + d — cd since + and • are well defined on Z. Thus, © and © are well defined 
on Z. Define / : Z —* Z by f(a) = 1 — a for all a £ Z. Clearly / is a one-one function of Z 
onto Z. Now f(a + b) = 1 — (a + b) = (1 — a) + (1 — b) — 1 = (1 — a) © (1 — b) = /(a) © f(b). 
Also, f(ab) — 1 - ab = (1 - a) + (1 - b) - (1 — a)(l — b) — (1 - a) © (1 - b) = f(a) © f (b). 
Thus, / preserves addition and multiplication of (Z,+, •) onto (Z,©, ©). Hence, it follows that 
(Z,©,©) is a ring isomorphic to the ring (Z,+, •). 

3. (ii) If there exists an isomorphism / of R. onto C, then there exists r E R such that 
f(r) = i. In this case f(r 2 ) = i 2 = —1. But /(— 1 ) — —1 since /(1) = 1 and so r 2 = —1. 
However, no such real number r exists, (iii) Yes. 

4. (ii) Yes (iii) No (iv) Ker q c I b,c £Z 

5. No. " ' J 

7. Suppose that there exists an isomorphism / of 2Z onto 3Z. Then 2Z and 3Z are 

isomorphic as additive groups under /. Thus, / maps a generator of the cyclic group 2Z onto 
a generator of the cyclic group 3Z. Hence, /(2) = 3 or /(2) = —3. Suppose /(2) = 3. Then 
/(4) = /(2 + 2) = /(2) + /(2) = 3 + 3 = 6 . Since / also preserves multiplication, /(4) = 
/(2 ■ 2) = /(2)/(2) = 3-3 = 9. However, this is impossible. 

10. No. 

12. 5 = {0,1,..., n — 1} is a subring of R isomorphic to Z n . The isomorphism is given 
by /(*) = [f] for all i £ S. 

13. Let / be a homomorphism of R into R. Since R is a field, either Ker / = {0} or Ker 
/ = R. If Ker / = R, then f(x) = 0 for all x £ R. Suppose Ker / = {0}. Then / is one-one. 
The desired result now follows as in Worked-Out Exercise 2 (page 313). 

16. (i) True (ii) True (iii) False (iv) False (v) True (vi) False. 

Exercises 12.1.2 (page 323) (Ring Embeddings) 

3. Define /* : F -> F' by r {ab~ l ) = /(a.)/( 6)~ 1 for all ab~ l £ F. Then a 6 _1 = ccT 1 
if and only if ad — be if and only if f(ad) = f(bc) if and only if f(a)f(d) = f(b)f(c) if 
and only if /(a)/( 6)“ 1 = /(c)/(d ) -1 if and only if f*(ab~ 1 ) = f*(cd~ 1 ). Thus, f* is a 
one-one mapping. Let a'b ' -1 £ F'. Since / maps R onto R ', there exist a,b £ R, b =£ 0 
such that f(a) = a' and f(b) = b'. Thus, /*(o 6 _1 ) = /(a)/(fe ) -1 = a'b'~ l . Hence, f* maps 
F onto F'. Let ab~ 1 ,cd~ 1 £ F. Then f*(ab ~ 1 + cd~ 1 ) — f* ((ad + ^(bd) -1 ) = f(ad + 
bc)f(bd)~ 1 = (f(a)f(d) + f(b)f(c))f(b)~ 1 f(d)_' = /(a)^fc ) -1 + f(c)f(d)~ l = f*(ab + 
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f*(cd~ 1 ). Also, f*(ab ~ 1 ■ cd~ l ) — f*(ac(bd ) -1 ) = f(ac)f(bd)~ 1 = /(o)/(c)/( 6 ) - 1 /(d ) -1 = 
f(a)f(b)~ 1 f(c)f(d)~ 1 = f*(ab~ 1 )- Suppose g* is an isomorphism of F onto F' such 

that g* - f on R. Then g*(ab~ 1 ) = g* (a)g* ( 6 -1 ) = g*(a)g*(b)~ 1 = /(a)/( 6) _1 = f*(ab~ 1 ). 
Thus, /* = 5 *. Hence, /* is unique. 

6. The field of quotients of Z[i] — | a,b,c,d £ Z, c + di ^ 0} = {p + qi \ p,q £ Q}. 

The field of quotients of Z[\/2] = { I &,b,c,d £ Z,c + d\/2 ^ 0} = {p + g\/2 | 

p,q € Q} 

Exercises 13.1.2 (page 334) (Direct Sum of Rings) 

2. We have A — RA = (R\ © • • • © R n )A = RiA © • • ■ © R n A. Let Ai = RiA for 
i = 1,2,... ,n. Then At is an ideal of Ri since RiAi = Ri(RiA) ~ RiA = Ai. Let g be the 
natural homomorphism of R onto R/A. Then R/A ~ g(Ri) © ■ • • © g(Rn) and g(Ri) — Ri/Ai 
for i = 1, 2,..., n. 

4. Z mn is a cyclic group of order ^vith geneiator [1]- Since gcd(/r/<, u.) — 1, Z m © Z n, 
is a cyclic group of order mn with generator ([1], [1]). Define g : Z mn —> Z m © Z n by g(i[l]) = 
z([l], [1]), where i G Z. Then g is a (additive) group isomorphism of Z mn onto Z m ©Z n - Now 

s([*][7']) = = a(ij[ i]) = [i]) = »([i]. [i])i([i]. [i]) = ff(*[i])5(i[i]) = s(MMbl)- 


Exercises 14.1.2 (page 343) (Polynomial Rings) 

!■ Let EHo aiX 1 , biX 1 £ /[x], where ai, bi £ /. Either m > n or n > m., say, m > n. 

If m > n, let fci = 0 for z = n + 1,. .., m. Then (X]™ 0 0ia;t — Er=o o( ai ~ £ 

/[x]. Let Yl k j =o T i x3 G Then (EjU^XEllo 0 *®*) = E^rCELo 7 ^*-*)® 5 G W 

Similarly, (EHo ° i:r ‘)(Ej=o r J® J ) e -H®]- 

3. (a:) is the set of all polynomials over R with constant term 0. 

4. (i) q(x) = x 2 + x — 1, r(x) = x + 3. (ii) q(x) = x 2 + x + [4], r(x) = x + [3]. 

5. q(x) — x, r(x) = [4]x 3 + [3]x 2 + x + [3]. 

8. ([1] + [2]x)([l] - [2]x + [4jx 2 ) = [1] + [8Jx 3 = [1], 

11. The units of Z[x] are 1 and —1. 

12. The units of Ze[x] are [1] and [5]. 

14. (iii) Use Exercise 9. 

15. (i) The proof is by induction on n. Suppose n = 1. Suppose that ri(xi)’ = 0. 

Then n = 0 for i = 0,1,..., m by the definition of a polynomial. Suppose the result is true 
for n — 1 > 1. Let p(xi,X 2 , • • • ,x n ) = ^2 in * ri 1 ...i 7 ,x \ 1 ... x % £ G R[xi,x 2 , ... ,x n }. Suppose 

that p(x 1 , 12 ,.. • ,Xn) = o. Now p( Xl ,X 2 ,.. . ,X n ) = Ei„, T il-i* x l • • • ®n n -l )®n n - 

Hence, yT i i Ti 1 ,,,i ri x \ 1 .. . x^Tj 1 = 0 for all i n . Thus, by induction ri 1 ..j„ = 0 for all 


j.« ■ j^TL ‘—i for G^ch 'in- 

(ii) Since r ii-i n x *i • • • = Ei„...ti Si i -if and only if = 

Si 1 ...i 7l for all zi, ... , i n , ct is well defined. By Defintion 14.1.13, a maps R[x i, x 2 , ■. ■, x n ] onto 
R[ci, C 2 ,..., c n ]. Since for any two polynomials f(xi, x 2 , ■. ., x n ), g{x \, x 2 ,..., x n ) € 
k(x i,x 2 , ■ • • yXn) = f{x i,x 2 ,... ,x n )+g(x i,x 2 ,..., x n ) implies k(a,c 2 ,... ,c n ) = /(ci,c 2 ,. .., 
Cn ) ~\~g (^*1 ? c 2 , • • •, Cn ) and b,(x i, x 2 ,. ■ • ,Xn) f (,xi, x 2 ,. . . ,Xti)p(xi,X 2 , • • -, in) implies /z(ci, c 2 , 

..., c n ) = /(ci,c 2 , ■ ■ -, c n )g(ci, c 2 ,... ,Cn ), it follows that a preserves + and •. 

16. K[x}/ (f{x)) = {fc(x) + (/(x)) | k(x) G /Cfx]}. Let fc(x) + (/(x)) G K{x}/ (f(x)) . Then 
there exist <j(x), r(x) G K[x] such that k(x) = g(x)/(x)+r(x), where r(x) = 0 or degr(x) < n. 
Thus, k(x) + (/(x)) = (q(x)f(x) + r(x)) + (/(x)) = (q(x)f(x) + (f(x))) + (r(x) + (f(x))) = 
(0+ (/(x))) + (r(x) + (f(x))) = r(x) + (f(x)) . 

17. (i) True (ii) False (iii) True. 


Exercises 15.1.2 (page 352) (Euclidean Domains) 

2. (go + gi y/3) = 1, tq + r \\/3 = 8 — 2\/3. 
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3. q 0 + q 1 i = 3 + 0 i, r 0 + r^i = 0 + If. 

6. Let / : Z -> Z„ be defined by /(a) = [a] for all a E Z. Then / is an epimorphism. 
Use Exercise 5. 

7. (i) True (ii) False (iii) False. 

Exercises 15.2.2 (page 359) (Greatest Common Divisors) 

1 . (i) The associates of 3 — 2 i are 3 — 2 i, —3 + 2i, 2 + 3 i, —2 — 3i. (ii) The associates of 
1 + iy /5 are 1 + iy/ 5, —1 — iy/ 5. (iii) The associates of [ 6 ] in Zio are [ 2 ], [4], [ 6 ], and [ 8 ]. (iv) 

[1] , [2], [3], [4] are the associates of [4] in Z 5 . (v) The associates of [2] + x are [2] + [x] and 
[!] + [%• 

2 . ± 1 . 

3. The units of Z[x] are 1 and — 1 . The associates of 2 + x — 3a ; 2 are 2 + x — 3x 2 and 
—2 — x + 3x 2 . 

5. Units of Z 7 [x] are the nonzero elements of Z 7 . The associates of x 2 + [2] are x 2 + [2], 

[2] x 2 + [4], [3]x 2 + [ 6 ], [4]x 2 + [1], [5]x 2 + [3], [ 6 ]x 2 + [5]. 

7. Since a and b are associates, a — bu and b = aw for some units u and w. Now 
v(a) = v(bu ) > v(b) and v(b) = v(aw ) > v(a). Hence, v(a ) = v(b). 

13. gcd(2 — 7z, 2 + Ilf) = 1; x = — (2 + 4f) and y = —3 i. 

16. Let a + by/2 be a unit such that 1 < a 4- by/2 < 1 + y/2. Then a 2 — 2 b 2 = ±1. That 
is, (a + b\/2)(a — by/ 2) = ±1. Hence, a — by/2 — a ^y 7 > • This implies that — 1 < a — by/2 < 1. 

From 1 < a + by/ 2 < 1 + y/2 and — 1 < a — by/2 < 1, it follows that 0 < 2a < 2 + y/2 , i.e., 
0 < a < 1 + ^. Thus, since a is an integer, a = 1. This implies that 1 < 1 + by/ 2 < 1 + y/2, 
a contradiction. Hence, there is no unit between 1 and 1 + y/2. 


Exercises 15.3.2 (page 365) (Prime and Irreducible Elements) 

1. In Z[iy/E], the only units are ±1 since x + yiy/ 5 E Z[iy/5] is a unit if and only if 

x 2 + 5 y 2 = 1. Let 2 + iy/ 5 = (a + biy/5)(c + diy/5) for some a + biy/5, c + diy/5 E Z[iy/5]. Then 
2 — iy/5 = (a — biy/5) (c — diy/5). Hence, 9 = (2 + iy/ 5)((2 — iy/ 5) = (a 2 + 56 2 )(c 2 + 5d 2 ). Now 
proceeding as in Example 15.3.11, it follows that either a 2 + 5b 2 = 1 or c 2 + 5 d 2 = 1. Thus, 
either a + biy/5 is a unit or c + diy/5 is a unit. Hence, 2 + iy/ 5 is irreducible. We now show 
that 2 + iy/5 is not prime. Since (2-\-iy/5)(2 — iy/5) = 9 = 3-3, (2 + fV / 5)|3-3. Suppose 2 + iy/S 
is prime. Then (2 + f\/5)|3 and so 3 = (2 + iy/5)(a + biy/5 ) for some a + biy/5 E Z {iy/5}. 
Then 9(a 2 + 5b 2 ) = 9. Thus, a 2 + 5 b 2 = 1 and so a + biy/5 is a unit. Hence, a + biy/5 = ±1. 

Suppose a -)- biy/5 = 1. This implies that 3 = 2 + iy/5. Thus, 3 = 2 and y/5 = 0, which is 

absurd. If a + biy/5 = — 1, then we would get 3 = —2 and y/5 — 0, which is again absurd. 
Hence, 2 + iy/5 is not prime. 

2. Suppose 1 + i = (a + bi)(c + di) for some a, b,c,d E Z. Thus, 1 — i — (a — bi)(c — di) 

and so l 2 + l 2 = (a 2 + b 2 )(c 2 + d 2 ), i.e., 2 = (a 2 + b 2 )(c 2 + d 2 ). Hence, (a 2 + 6 2 = 2 and 

c 2 + d 2 = 1) or (a 2 + 6 2 = 1 and c 2 + d 2 = 2). Suppose a 2 + 6 2 = 2 and c 2 + d 2 = 1. Now 

c 2 + d 2 = 1 implies that (c = 0, d ± 1 or c = ±1, d = 0). Thus, c + di is a unit. If a 2 + b 2 = 1 
and c 2 + d 2 = 2, then a + bi is a unit. Hence, 1 + i is irreducible. 

3. Since 3|(2 + iy/ 5)(2 — iy/ 5), 3 / (2 + iy/5), and 3 / (2 — iy/ 5), 3 is not prime in Z[iy/ f 5 . 

5. No. 

9. [3] and [6] are the prime elements. [3] and [6] are also irreducible. 

12. (i) False (ii) False (iii) False (iv) True (v) True. 


Exercise 16.1.2 (page 374) (Unique Factorization Domains) 

1. Since Z is a PID, the result follows by Lemma 16.1.6. 

4. Define N : Z[f-\/^] —+ N by iV(a + iby/ 6) = |a 2 + 66 2 | . As in Example 16.1.4, show 
that Z [z a/ 6] is an FD. Now 10 = (2 + iy/ 6)(2 — iy/5) = 5-2. Show that 5 is an irreducible 
element, but not a prime element, and use Theorem 16.1.10. 

Pii.>lc 7^o^Lc#fta£uiaZ 
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6. (i) False (ii) False. 


Exercises 16.2.2 (page 380) (Factorization of Polynomials over a UFD) 

2. Suppose /(x) is not irreducible in Q[cc], but /(x) is irreducible in Z[x], By Exer¬ 
cise 1 (page 380), /(x) is primitive. Then by Lemma 16.2.8, f(x) is irreducible in Q[x], a 
contradiction. Hence, /( x) is not irreducible in Z[x). 

3. Show that the ideal I = (x,y) is not a principal ideal in Q[x,y]. 


Exercises 16.3.2 (page 386) (Irreducibility of Polynomials) 

3. In Z 2 [x], x 2 + [2]x + [6] = x 2 = x ■ x. Hence, the polynomial x 2 + [2]x + [6] is reducible 
in Z2 [x]. Now x 2 + 2x T 6 has no roots in Q and so it is irreducible in Q. Thus, x 2 4- 2x 4- 6 
is irreducible in Z. 4. For x 2 + 2x + 6 E Z[x]. Let p = 2. 

6. As in Example 16.3.6, show that /(x) has no roots in Q. 

8. First show that g(x) = 15x 2 4- 5x — 6 is irreducible in Q[x], 

9. Consider /(x — 1) and use Eisenstein’s criterion. 

11. Use Eisenstein’s criterion. 

12. x 2 + [l], x 2 + [l]x + [2], x 2 +[2]x +[2], [2]x 2 + [2], [2]x 2 +x + [l], and [2]x 2 + [2]x + [1]. 

i5. (ii) \p{p - 1). 

Exercises 17.1.2 (page 399) (Maximal, Prime, and Primary Ideals) 

1. Maximal ideals: {[0], [5]}, {[0], [2], [4], [6], [8]}. Prime ideals: {[0], [5]}, {[0], [2], [4], 
[6], [8]}, and Z 10 . 

3. Only one maximal ideal: {[0], [p], [2p],.. ., [(p — l)p]}. 

4. I is a maximal ideal. 

7. Let I be a nonzero proper ideal of R such that I is prime. Let J be an ideal of R 
such that I C J. There exists a E J such that a £ I. Now a(l — a) = 0 E / and a I. Hence, 
1 — a & I C J. Also, a E J and so 1 E J. Thus, J = R and so I is a maximal ideal. The 
converse follows by Theorem 17.1.7, since every Boolean ring is commutative. 

13. Let /(x),y(x) E K[x\. Then 0 a (/(x) + g{x)) = f(a) + g{a) = <p a (f(x)) + <p a (g(x)) 
and <p a (f(x)g(x)) = /(a)y(a) = 0 a (/(x))</> a (y(x)). Therefore, 0 a is a homomorphism. Let 
b E K. Now f(x) = b — ax + x 2 E K[x] and <jf>a(/(x)) = b — a 2 + a 2 = b. Thus, (p a is an 
epimorphism. Hence, AT[x]/Ker <p a — K. Since A is a field, A[x]/Ker <p a is a field and so Ker 
4 > a is a maximal ideal. 

15. Clearly D a I a is an ideal. Let ab E n Q / Q . If either a E la for all a or b E I a for all 
a, then either a E C\ a I a or b E n Q J Q . Suppose a, b £ fl Q / Q . Then there exist a and /? such 
that a £ I a and b £ Ip. Since {I a } is a chain, either la C Ip or Ip C I a . To be specific, let 
la Q Ip- This implies that b £ I a - Since ab E la, and I a is prime we must have either a E I a 
or h E I a , a contradiction. Therefore, either a E fUla or b E fl a / Q . Consequently, n a I a is a 
prime ideal. 

19. (ii) (x + 2)(x + 3) E (x, 6) , but neither (x + 2) n E (x, 6) nor (x + 3) n E (x, 6) for any 
positive integer n. Hence, (x,6) is not primary. 

24. (i) Za has only one maximal ideal, namely, {[0], [2], [4], [6]} and so Za is a local ring. 

30. (i) True (ii) True (iii) True (iv) False (v) True (vi) False (vii) False (viii) True. 


Exercises 17.2.2 (page 405) (Jacobson Semisimple Ring) 

1. Suppose the ring Z n , n > 1, is J-semisimple. There exist prime integers pi,f 2 , ■ ■ • ,Pk 
such that n = P 1 P 2 • • -pk- We claim that pi,p 2 , ■ ■ ■ ,Pk are all distinct. Suppose there exist 
1 < i < j < k such that pi = Pj = p (say). The mapping (3 : Z —*■ Z n defined by j3 (m) = [m] 
is an epimorphism of rings and Ker (3 = nZ. Now Ker (3 C fZ, where t > 1 and t. divides n. 
Hence, the ideals of Z n are of the form I t = {[0], [t], [2 1],. ..}, where t > 1 and t divides n. 
Note that /tj C I (2 if and only if t 2 divides t\. Hence, the maximal ideals are given by I Pi 
for i = 1,2 ,k. Now if t = ppip 2 • • -pi-iPi+i - ■ - pk, then t is divisible by each pt and t\n. 
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Therefore, I t C I Pi for all i = 1,2,...,fc. Thus, It C C\I Pi = radZ n and so radZ n ^ {0}, a 
contradiction. Hence, n is square free. If n is square free, then from the above argument we 
can prove that Z n is J-semisimple 

2. Yes. 

3. Let R be a PID together with the given properties. Each maximal ideal of a PID is 
generated by a prime element and each prime element of R generates a maximal ideal. Hence, 
radi? = n p is a prime eiment of r (p) ■ Let a E rad R. Then a E (p) for all prime elements p of 
R. Hence, a has an infinite number of nontrivial divisors. Since R is also a UFD, it follows 
that a = 0 or a is a unit. But radii does not contain any units. Hence, a = 0 and so R is 
J-semisimple. 

6. (i) True (ii) True (iii) False (iv) True. 

Exercises 18.1.2 (page 419) (Noetherian and Artinian Rings) 

2. Consider the field Q of rational numbers. The polynomial ring Q[x] is Noetherian. 

Let R = { /(x) E Q[x] | the constant term of /(x) is an integer}. Then R is a subring of 
Q[x). We show that R is not Noetherian. For this, let I T = (2 r x) for t = 0, —1, —2,.... Then 
Jo C /-i C 1-2 C - ■ • is an infinite strictly ascending chain of ideals of R. Hence, R is not 
Noetherian. 

3. Z(p“). 

10. (i) Since I E T\ y - (j>- Since R is right Artinian, T\ has a minimal element, say 
Iq. Now Iq C Iq. Since Io is minimal, either Iq — {0} or Iq — Iq. If Iq = {0}, then Jo is 
a nilpotent right ideal, a contradiction. Hence Io = Io- (ii) Since Io — Io, we find that 
Jo E T and so T ^ 0. Now R is right Artinian. So T contains a minimal element, say I\. 

Then I\Io ^ {0}. This implies that there exists a nonzero element u E Ii such that ulo is 
a nonzero right ideal of R and ulo C I\ C I 0 . Also, (ulo)lo = ulo = ulo y^ {0} shows that 
ulo £ T. Since I\ is minimal in T, it now follows that ulo — I\• (iii) ulo = Ji implies that 
ua = u for some a E Jo- Since u 0, we find that a ^ 0. Also, u = ua = ua 2 = ua 3 = ■ • • 
shows that a n ^ 0 for all n > 1. Thus, I contains nonnilpotent elements. Hence, I is not 
a nil right ideal. (iv) Let T — {r E Io | ur = 0}. Then T is a right ideal of R such 
that T C Iq. Since ulo = I\ {0}, it follows that T C Iq. Since Io is a minimal element 
in T\, it follows that T is a nilpotent right ideal. Also, u(a 2 — a) = 0 (a E Io, see (iii)) 
implies that a 2 — a E T. Hence, a 2 — a is nilpotent. Thus, there exists a positive integer n 
such that (a 2 — a) 71 = 0. This implies that a 71 — a n+1 f(a ) for some polynomial f(x ) in Z[x]. 

Thus, a n = a(a m /(a)) = a 71 + 2 f(a) 2 . Proceeding in this way, we obtain a n = a 2 n f(a) n . Let 
e = a n f (a) n . Now e E I and e 2 = a 2n f(a) 2n = (a 2n f(a) n )f(a) n = a n f(a) n = e. Clearly 

0, otherwise, e = 0 implies a n — 0. Hence, I contains a nonzero idempotent. 

13. (i) True (ii) False (iii) True. 

Exercises 19.1.2 (page 431) (Modules and Vector Spaces) 

1. (i) Yes (ii) Yes (iii) Yes (iv) No. 

6. (ii) Let x E (A + B) D C. Then x E C and x = a + b for some a E A and b E B. Now 
b — x — a € C + A = C. This implies that b E B fl C. Hence, x = a + bGA + (BC\C). Thus, 

(A + B)flC C A + (Bfl C). On the other hand, let x E A + (B fl C). Then x = a + b for 
some a E A and b E B 0 C. Then a + b E A + B and a + b£A + C = C. This implies that 
x E (A + B)flC. Thus, A + (BC\C) C (A + B)(lC. Consequently, A + (B fl C) = (A + B) fl C. 

8. Suppose M is a simple module and let 0 / m E M. Then Rm ^ {0} and Rm. 
is a submodule of M. Hence, M = Rm and so M is generated by m. Conversely, assume 
that Rm. = M for all nonzero elements m E M. Let T be a nonzero submodule of M. Let 
0 / aET. Then Ra ^ {0} and Ra C T. Thus, M — Ra C T C M and so T — M. Hence, M 
is simple. 

16. Let W be a subspace of V. Let dim V = n. Then dim W = m < n. Let {uji, W 2 ,..., w m } 
be a basis of W. Now {w\,W 2 , ■ ■. ,U) m } is a linearly independent set in V and hence can be 
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extended to a basis of V, say {w\,W 2 , ■.., zc m , w m +i,. .. ,w n }. Let U be the subspace of V 

generated by {w m +i, ■ ■ ■, tCn}- Now W + U C V. Let 2 G F. Then x = a\W\ H-+ a m -ir/ m -|- 

a m +iui m+ i + • ■ ■ + a n Wn for some a x G F, 1 < i < n. Since a\u>i + ■•• + a-mW-m G W 
and dm+iWm+i + • • • + a n tu n G V, x G W + U. Hence, V — W + U. Let x G W D U. 
Then x — b\Wi + • • ■ + bmW m and x = bm+iWm+i + ■ • • + 6 n m„ for some bi G F. Thus, 
biwi + - • • + b-mWm. = b m+ iWTn+i + ■ • • + b n w n , which implies that b±wi + ■ • • + bmWm + 

(—5 m+ i)w m+ i H-b (—b n )w n = 0. This implies that bi = 0, 1 < i < n. Thus, x = 0. Hence, 

W PI U = {0}. Consequently, W is a direct summand of V. 

Exercises 20.1.2 (page 437) (Full Matrix Rings) 

1. {0} and M n (R). 

Exercises 20.2.2 (page 444) (Rings of Triangular Matrix) 

2. Since R is of infinite dimension over Q, R is not an Artinian right Q-module. Hence, 

and so S ^ is not right Artinian by Theorem 20.2.3. 

K 

Exercises 21.1.2 (page 457) (Algebraic Extensions) 

3 (i). Suppose that 7r 2 is algebraic over Q. Then [Q(tt 2 ) : Q] < oo. Now [Q(tt) : 
Q(7r 2 )] = 2 and so [Q(7 t) : Q] < oo. Thus, 7r is algebraic over Q, a contradiction. Hence, 7r 2 
is transcendental over Q. 

5. Suppose that 7r is algebraic over Q(\/2). Then [Q(\/2)(7r) : Q(V2)] < oo and since 
[Q(V2) : Q] < 00, (Q(V2)M : Q] < oo. Hence, every element of Q(\/ 2 )( 7 r) is algebraic over 
Q. However, this is impossible since 7r is transcendental over Q. 

8. [Q(^5) : Q] =3. 

9. [Q(V / 3, y/5) : Q] =4. 

10. Suppose that i 2 — 5 is not irreducible over Q(v^). Then there exist polynomials 
x — a, x — b G Q(V2) such that x 2 — 5 = (x — a)(x — b) = x 2 — (a + b)x + ab. Hence, a + b = 0 
and ab = —5. Thus, a 2 = 5. Hence, \/5 G Q(\/2). Thus, there exists p,q G Q such that 
\/5 = q + p\/ 2. Hence, 5 = q 2 + 2pq\/2 + 2p 2 and so \/2 G Q, a contradiction. 

11. x 4 -4z 2 -1. _ 

13. [Q(V2, \/5) : Q] = 4. (1, V2, V5, \/l0} is a basis of Q( v / 2, V5)/ Q. 

14. By Theorem 21.1.18, every element of K(c)/K is algebraic over K. Now /(c) G K(c). 
Thus, /(c) is algebraic over K. 

23. (i) True (ii) False (iii) False. 

Exercises 21.2.2 (page 467) (Splitting Fields) 

4. S = Q(\/3, iy/3), [5 : Q] = 6, and {1, \/?>, -^9, iy/3, iy/3\^3, i V3-^9} is a basis of S 
over Q. 

5. [S : Z 5 ] = 2 and {1, A} is a basis of S/ Z 5 , where A is a root of z 2 + x + [1]. 

6. S — Z 2 , and {[1]} is a basis of S/Z 2 . 

7. S = Q(\/2, \/5), [5 : Q] = 4 and {1, \/2, \/5, V / 10} is a basis for S/Q. 

8. [Q(-5 + #i):Q] = 2. 

12. The proof is by induction on n. Suppose that n — 1. Then f(x) = ax + b for some 
a, b G K, a ^ 0. Hence, S = K{— £) and [5 : K] = 1 < l! Assume the result is true for 
all polynomials of degree < n — 1. Suppose deg/(x) = n. Let A be a root of f{x). Then 
[K( A) : K] < n and f(x) = (2 —A)p(z) over K{ A) and deg g(x) = n — 1. Let S be the splitting 
field of g(x) over K( A). Then [5 : AT(A)] < (n — 1)! by the induction hypothesis. Now S is 
the splitting field of f(x ) over K and [5 : K] = [5 : K(X)][K(X) : K] < (n — 1)! ■ n = n\ 

15. (i) False (ii) True. 

Exercises 22.1.2 (page 489) (Multiplicity of Roots) 
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5. (i) separable (ii) separable. 

6 . (i) e = 3 and no — 2. (ii) e = 0 and no = 128. (iii) e = 3, no = 1. 

8 . Consider p(x) in Example 22.1.16. p(x) is irreducible over K(t) by Eisenstein’s 
criterion. D x (p{x)) = 0 and so p(x) is inseparable over K(t). Now e = 1 and no = 2. Thus, a 
root of p(x) is inseparable, but not purely inseparable over K{t). 

Let p{x) = x 2p + vx p + u in Example 22.1.39. By Worked-Out Exercise 6 (page 456), 
p{x) is irreducible over K. Now D x (p(x)) = 0 and so p(x) is inseparable over K. Also, e = 1 
and no = 2 . Thus, any root of p(x) is inseparable, but not purely inseparable over K. 

10. Since [F : K] < oo, F/K is algebraic. Let a G F. Let p(x) be the minimal polynomial 
of a over K. Then [if (a) : K ] divides [F : K]. Hence, p does not divide [K(a) : K], Thus, the 
exponent of inseparability of p(x) is 0. Hence, a is separable over K. Thus, F/K is separable. 

13. Z p is the smallest subfield of K. Now for all x G Z p \{0}, x p 1 — 1 and so x p = x. 
Hence, (Z p ) p = Z p and so Z p is perfect. 

15. (i) False (ii) True (iii) True. 

Exercises 23.1.2 (page 497) (Finite Fields) 

2. In order to construct a field with 9 (= 3 2 ) elements, we find an irreducible polynomial 
of degree 2 over Z 3 . For this, consider the polynomial x 2 + [l] over Z 3 . Since [0 ] 2 + [1] 7 ^ [0], 

[l ] 2 + [1] 7 ^ [0], [2 ] 2 + [l] 7 ^ [0], Z 3 contains no roots of x 2 4 - [l]. Hence, x 2 + [1] is an 

irreducible polynomial of degree 2 over Z 3 . This implies that F — Z 3 / (x 2 + [1]^ is a field. 
Since [F : Z 3 ] = 2, F has nine elements by Theorem 23.1.1. 

6 . Let [F : Z p \ = m. Then |F| = p m by Theorem 23.1.1. Hence, n = m. 

7. Let F be the splitting field of x 3 — x over Z3. Then |F| = 3 2 and [F : Z3] = 2. Since 

every element of F is a root of a: 3 — x by Theorem 23.1.2 and since [F : Z3] = 2, F = Z p (c) 
for any root c of g(x), where g(x) G Z 3 [^] is such that x 3 ~ — x = x(x — [l])(x — \2\)g(x). 

9. By Theorem 23.1.1, F ~ GF(p Tn ) for some m and clearly m < n. Now n = [ GF(p n ) : 
Zp] = [GF(p n ) : F][F : Z p ] — [GF(p n ) : F]m. Thus, m\n. 

Exercises 24.1.2 (page 502) (Normal Extensions) 

1. (ii) R is not a normal extension of Q. 

2. (ii) No. Consider L = Q(v^) over Q. 

Exercises 24.2.2 (page 518) (Galois Theory) 

1. G(C/R) consists of the identity automorphism and the automorphism a of C/R such 
that 0 ( 2 ) — — 1 . 

2. (i) [F : Q] = 6, [N: Q] = 12, and G(N/Q) ~ Z 2 x S3, (ii) [F : Q] = 6, [N : Q] = 12, 
and G{N/ Q) ~ Z 2 x S3. 

5. G(S/Q) ~ S 3 , x 3 — x — 1 has one real root, say T\, and two complex roots, say r 2 and 
T3. The following table defines G(S/ Q). 



e 

a 

0 

af3 

Pa 

a/3a 

T\ 

r\ 

T\ 

r 2 

T3 

T2 

r 3 

t 2 

T2 

r 3 

T\ 

ri 

r 3 

r 2 

T3 

T3 

T2 

T3 

T2 

ri 

T\ 


The proper subgroups of G(S/ Q) are Hi = {e,a}, H 2 = {e,/3}, H 3 = {e,a/3a}, Hu = 
{e, a(3, a/3), and the corresponding intermediate fields of S/ Q are L\ = Q(ri), L2 = Q(i'3), 
L 3 = Q(r 2 ), L 4 = Q(d). 

6 . (i) Z 2 x 1 S 3 . (ii) Z 2 . 

10. The intermediate fields of Q(i,V / 7)/Q are Q, Q(i, y/7), Q(i), Q(v/7), and Q (iy/f). 
12. |G(F/Q)| = 8. 
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13. The Galois group G of F over Q is isomorphic to Z 2 x Z 2 . All the intermediate fields 
of Q(\/3, VTI)/Q are normal extensions of Q in Q(\/3, \/Il). 


Exercises 24.3.2 (page 527) (Roots of Unity and Cyclotomic Polynomials) 

1. Z 2 . 

2. Both are isomorphic to Z 2 - 

4. Show that $ 2 n{x) — $n( —x) - 

5. Let 4/(x) = . If a; is a primitive pn th root of unity, then w p is a primitive nth 

root of unity. Therefore, a; is a root of 4> n (x p ). Hence, is a monic polynomial in Q[x] 

of degree <p{n). Thus, so is ^(x). If 8 is a primitive nth root of unity, then so is S p since 
p and n are relatively prime. Therefore, 6 is a root of $ n (x p ). Thus, 6 is a root of 'I'(x). 
Consequently, 'J'(x) = 4> n (x). 

6. x 2 + 1. 

10. By Exercise 7 (527), F contains a primitive nth root of unity, say u>. Now u> r is a 
primitive nth root of unity if and only if , J r generates (u) if and only if r and n are relatively 
prime. 


Exercises 24.4.2 (page 535) (Solvability of Polynomials by Radicals) 

1. Consider x 3 + |x + 3. Then 6 = 0, c = d = 3. Now p = |, q = 3, s = 
yj— | + \/^ + | = \J~ | + § y/J, and t = yj— § — § \/\■ The roots of 2x 3 + 9x + 6 
are s + t, us + Lu 2 t, and ui 2 s + cut, where a; is a cube root of 1, u> ^ 1. 

4. (i) x 2 + x 2 + x 2 = (x 1 + x 2 + X 3) 2 — 2(xiX 2 + xix 3 + x 2 X 3 ). (ii) (xi — x 2 ) 2 (xi - 
X 3 ) 2 (x 2 — X 3) 2 = (xi +x 2 + X 3 ) 2 (xix 2 + X 1 X 3 + X 2 X 3 ) 2 — 4(xix 2 + X 1 X 3 + X 2 X 3 ) 2 — 4(xi + x 2 + 
X 3) 3 (X 1 X 2 X 3 ) — 27(xiX 2 X3) 2 + 18(xi + X 2 + Xs)(xiX 2 + X 1 X 3 + x 2 X 3 )(X 1 X 2 X 3 ) . 

6 . A 3 . 

7. Show that the Galois group of /(x) is isomorphic to S5. The equation /(x) = 0 is not 
solvable by radicals. 


Exercises 25.1.2 (page 548) (Geometric Constructions) 

4. Prove that 2 cos ^ is a root of the polynomial x 3 +x 2 — 2x — 1 and that this polynomial 
is irreducible over Q. 

5. Use Example 25.1.21 and Worked-Out Exercises 2 (page 547) and 5 (page 548). 

6. In the proof of Theorem 25.1.19, we see that an angle of 9° can be trisected if and 
only if the polynomial 4x 3 — 3x — cos# is reducible over Q(cos#). Since cos90° = 0, we have 
that an angle of 90° can be trisected. 


Exercises 26.1.2 (page 557) (Binary Codes) 

4. 0 e C and so C 7^ <fi. Let cT.ci e C. Then (cl — C 2 )K — c{K — C 2 K = 0—0=0. 
Hence, cT — ci" £ C. Hence, C is a subgroup of B n . 
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0000M 

— 

0000000 

0001 M 

= 

0001011 

0010M 

— 

0010101 

0011 M 

~ 

0011110 

0100 M 

= 

0100110 

0101 M 

— 

0101101 

0110 M 

= 

0110011 

0111 M 

= 

0111000 

1000M 

= 

1000111 

1001 M 

= 

1001100 

1010 M 

= 

1010010 

101 IM 

— 

1011001 

1100 M 

= 

1100001 

1101 M 

= 

1101010 

1110 M 

— 

1110100 

HUM 

— 

1111111 


7. (i) The code can detect one error. The code cannot correct every single error. 

8 . 


c 

000000 

001011 

010101 

011110 

100110 

101101 

110011 

111000 

100000 + c 

100000 

101011 

110101 

111110 

000110 

001101 

010011 

011000 

010000 + c 

010000 

011011 

000101 

001110 

110110 

111101 

100011 

101000 

001000 + c 

001000 

000011 

011101 

010110 

101110 

100101 

111011 

110000 

000100 + c 

000100 

001111 

010001 

011010 

100010 

101001 

110111 

111100 

000010 + c 

000010 

001001 

010111 

011100 

100000 

101111 

110001 

111010 

000001 + c 

000001 

001010 

010100 

011111 

100111 

101100 

110010 

111001 

100001 + c 

100001 

101010 

110100 

111111 

000111 

001100 

010010 

011001. 


001111 is decoded as 001011; 101010 is decoded as 001011; 111110 is decoded as 011110. 
12. s = 4. K is (2 4 - 1) x 4 or 15 x 4. 


K = 


' 0001 ' 
0010 
0011 
0100 
0101 
0110 
0111 
1000 
1001 
1010 
1011 
1100 
1101 
1110 
1111 


Exercises 26.2.1 (page 570) (Polynomial and Cyclic Codes) 
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"110 1- 
0 0 0 1 

'111110 0 1 1111 

2. G= 0111110 .3. K= 1000 

. 0 0 1 1 1 1 1 J 0100 

0 0 10 
. 0 0 0 1 . 

5. Suppose (/(x)) C ( h,(x )) . Then there exists q(x ) such that h,(x)q(x) — f(x). Hence, 
h(x)q(x) = /(x) + x TL — 1. Now h(x) |(x n — 1) and so there exists m.(x) E Z 2 [x] such that 
h(x)m(x) = x n — 1. Thus, h(x)q(x ) = /(x) + h,(x)m(x ) and so h(x)(q(x) — m(x)) — /(x). 
Hence, /i(x)|/(x). 

7. The proof follows from the proof of Theorem 26.2.6 since every element of C is of the 
form x l g(x) and C is closed under addition. 

Exercises 26.3.1 (page 573) (Bose-Chauduri-Hocquenghem Codes) 

2. 1 = 0 + 0 4 ^ 0 and 1 + 1 + l 4 ^ 0. Hence, if 1 + x + x 4 factors nontrivially over Z2, it 
must be a product of two quadratics. Suppose 

1 + x T x 2 = (a + bx + cx 2 )(d + ex + fx 2 ), 

where a, b, c, d,e, f E Z2. Then 1 = ab, 1 = ae + bd, 0 = af + be + cd, 0 = 6/ + ce, 1 = cf. 

Thus, a — d — c~f — 1, 1 = e + b, 0 = 1 +6e+1,0 = 6 +e. This is impossible. Hence, 

1 + x + x 4 is irreducible over Z2. 

Exercises 27.1.2 (page 582) (Affine Varieties) 

2. Show that (x 1 ,y^ C I(V(x l , y -')). 

4. x + 1. 

Exercises 27.2.2 (page 600) (Grobner Bases) 

1. (i) multideg(/) = (4,1,2). (ii) LC(/) = -5. (iii) LM(/) = x 4 yz 2 . (iv) LT(/) = 
—5 x 4 yz 2 . 

2. 01 — x 2 y — xy + y, = — y and r = —xy 3 + x 2 y — xy — y. 

3. a 1 = x 3 y + x 3 + 1, a2 = 0 and r = 2x 3 y + x 3 + y + 1. 

4. S(f, g) = -2 xyz 2 + x 2 y 3 z 2 - x 2 yz 2 . 

5. J = (x 2 ) , Jo = {0}, Ji = {0}, J 2 = (x 4 ) , J 3 = (x 4 ) , J 4 = (x 3 ) . 

11. {x 2 — y, yz} is a reduced Grobner basis for I. V (/) = {(x, x 2 ,0) | x E R} U {(0, 0, z) 
| z£R}. 


“Public. 



Index 


A 

Abel, Niels Henrik, 82 
Abelian group, 59 

fundamental theorem of finite, 251 
fundamental theorem of finitely gen¬ 
erated, 262 
ACC, 406 

action of groups, 172 
affine variety, 574-575 
algebraic closure, 470 
algebraic element, 447 
algebraic field extension, 453 
algebraically closed field, 469 
algebraically independent, 341 
alternating group, A n , 94 
Artin, Emil, 491 
Artinian ring, 412 
left, 412 
right, 412 

ascending chain condition for principal 
ideals, 368 
associate, 353 
associated prime ideal, 394 
automorphism 

of groups, 144 
of rings, 309 

B 

basis, 423 

Grobner, 592 
minimal Grobner, 597 
reduced Grobner, 598 
standard, 592 
betti number, 266 
binary operation, 52 
associative, 52 
closed under, 52 
commutative, 52 
binary symmetric channel, 551 


Boolean ring, 284 
Burnside theorem, 175 

c 

Cartesian product, 4, 47 
Cauchy’s theorem, 196 
Cauchy, Augustin-Louis, 98 
Cayley’s theorem, 149 
Cayley, Arthur, 180 
center 

of groups, 101 
of rings, 271 
centralizer, 190 
chain, 32 

factors of, 223 
length of, 223 
normal, 223 
subnormal, 223 
chain condition 
ascending, 406 
descending, 406 
characteristic 

of a ring, 278 
subgroup, 165 

Chinese remainder theorem, 30 
Chinese remainder theorem for rings, 
333 

circle, 539-540 
class equation, 192 
code 

BCH, 571 

distance between, 554 
group, 555 
Hamming, 553 
polynomial, 558 
weight of, 554 
codeword, 549 
commutative 
group, 59 





INDEX 


629 


ring, 270 
commutator, 232 
subgroup, 232 
comparable, 32 

complete direct sum of rings, 326 
composition of functions, 43 
congruence modulo n, 23 
conjugate class, 191 
conjugate element, 464 
constructive 
circle, 544 
line, 544 

line segment, 544 
point, 544 
real number, 544 
constructible real number, 542 
content of a polynomial, 375 
coset, 

left, 116 
right, 116 
cover, 34 

cyclic linear code, 561 
cyclic module, 423 
cyclic shift, 561 
cyclic structure, 97 
cyclotomic extension, 521 
cyclotomic polynomial, 521 

D 

DCC, 406 
decoding 

error, 551 
failure, 551 
scheme, 549 
decoding algorithm 
complete, 551 
incomplete, 551 
decoding table, 556 
Dedekind, Richard, 317 
degree of a polynomial, 337 
degree of inseparability, 478, 487 
degree of separability, 478, 486 
DeMorgan’s law, 5-6 
dihedral group, 166, 215 
dimension, 428 
direct sum of rings, 327 
direct summand, 433 


disjoint permutations, 89 
divide, 11, 353 
division algorithm, 338 
division ring, 274 
divisor, 11, 353 

common, 11, 354 
greatest common, 11, 354 
domain 

Euclidean, 345 
factorization, 367 
integral, 275 
principal ideal, 347 
unique factorization, 369 
double coset, 126 

E 

Eisenstein’s irreducibility criterion, 383 
element 

algebraic, 447 
associate, 353 
centralizer of an, 109 
conjugate, 190, 464 
fixed, 175 
fixed by, 504 
idempotent, 279 
identity, 53, 59, 271 
image of, 40 
inseparable, 474 
integral power of an, 67 
inverse of an, 59 
invertible, 273 
irreducible, 360 
order of an, 68 
preimage of, 40 
prime, 360 
primitive, 479 
purely inseparable, 481 
reducible, 360 
regular, 285 
relatively prime, 360 
separable, 474 
separable algebraic, 474 
transcendental, 447 
unit, 273 

elementary divisors, 252 
elementary symmetric functions, 533 
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elementary symmetric, 533 
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G 
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generator, 447 
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isomorphic, 144 
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natural, 144, 312 
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trivial, 141 
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nil, 303 
nilpotent, 303 
nontrivial, 296 
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right, 295 
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algebraic, 372 
division algorithm, 10 
partition of an, 254 
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of groups, 144 
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M 
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right, 422 
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monic polynomial, 337 
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coefficient of, 574 
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